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TREATMENT OF PAIN THROUGH EXPRESSION OF OPIOID RECEPTORS 

[01] This application claims priority of United States Provisional Application No. 
60/448,663, filed on February 19, 2003, which is herein incorporated by reference in its entirety. 

L BACKGROUND 

5 [02] Tissue injury and nerve damage caused by trauma, infection, arthritis or iatrogenic 

procedures can produce inflammation, spontaneous pain and hyperalgesia (Levine JD and Taiwo 
YO, Anesth Prog 1990;37:1 33-35; Goelet P, et al., Nature. 1 986;322(6078):419-22). Furthemore, 
patients affected by ten^romandibular joint and/or masticatory muscle (orofacial) pain often suffer 
because dental, surgical and/or pharmacologic therapies do not consistently give adequate symptom 

10 rehef. In feet, according to Public Health Services (PHS) estimates, there are more than 50 million 
Americans who experience chronic pain with 45% seeking medical care at some point in &eir lives. 
It is also estimated that 40% of pain patients never receive adequate relief. liptonetal. (LiptonJA, 
et al., JADA 1993;124:1 15-21.) reported that 22% of the population in the United States experi^ced 
at least one episode of orofacial pain in the last six months. It is estimated that approximately $80 

15 biUion is spent annually to treat pain and that 40% of that is to treat cranio&cial pain (Bonica JJ. 
Preface et aL, eds. Advances in Pain Research Therapy, Vol 3. New York: Raven Press; 1973:v- 
vii). To date, pharmacological approaches still dominate the clinical pain arena, with only modest 
efforts being directed towards the development of new innovative treatment regimes for the 
management of pain. Disclosed are vectors and methods for reducing pains, such as myalgic and 

20 arthralgic pains, and such as those in the oro&cial region. 

n. SUMMARY 

[03] As embodied and broadly described h^ein, disclosed herein, in one aspect, are 
vector constructs that comprise sequence encoding a polypeptide for treating pain. Also disclosed 
are methods for treating pain by expressing the ^-opioid receptor protein in nerve cells. 

25 [04] It is to be understood that both the foregoing general description and the following 

detailed description are exemplary and explanatory only and are not restrictive. 

m. BRIEF DESCRIPTION OF THE DRAWINGS 

[05] Figure 1 shows the human ^-opioid transient transfection in N2a cells. Figure 1 A 
shows that immunocytochemistry reveals expression of HUMOR driven by the cytomegalovirus 
30 CMV promoter in neuronal cells using monoclonal antibody. Figure IB shows mock 
transfected cells. 
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[05] Figure 2 shows FIV(lacZ) local administration to the temporomandibular joint 
(TMJ) area of the face. Figure 2A shows FIV was injected at the right TMJ with 10* infectious 
particles per mL, which receives sensory innervation from the mandibular division of the 
trigeminal nerve. SaggitaI(B) and horizontal (Qsections of the right gasserion (trigeminal) 
5 gang^on display X-gal positive neuronal cellbodies that were transduced following FIV(lacZ) 
injection in the TMJ. 

[06] Figure 3 shows a representation of a lentiviral system containing the HUMOR 
gene. The 3-vectorFIV(HUMOR) system. The FrV(HUMOR) lentiviral system is comprised 
of 3 vectors: Packaging vector providing the packaging instructions in trans,- VSV-G envelop 
10 vector (VSV-G sequence in SEQ ID NO:54) providing the envelop instructions in trans, - and 
FrV(EDLIMOR) vector containing the therapeutic gene. 

[07] Figure 4 shows a schematic of an exemplary LTV vector carrying a BWMOR 
cassette. 

[08] Figure 5 shows FrV(lacZ) injection (a total of 5X10^ mfectious particles) to the 
1 5 right TMJ resulted in widespread infection of hard as weU as soft tissues of Ihe joint (A) 

Sagittal TMJ sectidns analyzed by p-galactosidase immunohistochemistry and counter-stained 
by nuclear fast red revealed expression of the reporter gene lacZ in the hypertrophic zone of the 
condyle, primarily comprised of cartilaginous cells, (B) as well as in the meniscus, endothelial 
cells and perivascular osteocytes. Panel (C) depicts TMJ sections from a saline injected animal. 
20 c=condyle; d=disk; m=muscle; v=vessel. 

[09] Figure 6 shows the development of the control FIV(A'lac) vector with inactive P- 
galactosidase gene. (A) The reporter gene lacZ was inactivated after deletion of a critical placZ 
DNA fragment containing the P-galactosidase gene transcription initiation site by restriction 
enzyme-mediated excision and re-ligation of the backbone vector. (B) The structure of mutated 

25 FIV(A'lac) and wild type FlV(lacZ) viral vectors were confirmed by PGR following transient 
transfection into the murine cell line NIH 3T3. The presence of viral DNA in cells was detected 
by a 444 bp DNA band utilizmg the *TIV" primers (as depicted in panel A). The complete 
structure of lacZ gene was confirmed by a 1 .7 kb DNA band utilizing the lacZ primers (depicted 
as UP, LP in panel A), hi the case of the mutated FIV(A'lac), there was lack of the 1 .7 kb DNA 

30 band as the annealing site for the lower primer LP was excised. (C) Deletion of the lacZ 

transcription initiation sequence in the FIV(A'lac) resulted in inactivation of the (3-galactosidase 
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reporter gene as demonstrated by the lack of X-gal staining compared to (D) the FIV(lacZ) 
vector. 

[10] Figure 7 shows FIV(lacZ) and FIV(A'lac) injections (5X10^ mfectious particles) 
in the rigjbit TMJ of mice resulted in successful infection of primary sensoiy neurons located in 
5 &e ispilateral trigeminal ganglion. The animals' left side TMJ was not treated (A) The presence 
of backbone FIV DNA in the right trigeminal ganglia ipsilateral to FIV injections was detected 
by a 444 bp DNA band in lanes 1 and 3, utilizing the 'TIV** primers (as depicted in panel A), 
suggesting successful tiansduction of the trigeminal sensoiy neurons by FIV vectors. Lanes 2 
and 4 do not display any viral DNA as fliey represent left side ganglia. (B) The inactive form of 
10 P-galactosidase gene in transduced neurons was detected by the absence of the 1.7 kb DNA 
band (lane 1) compared with the wild type gene (lane 3). Lanes 2 and 4 do not display any viral 
DNA as they represent left side ganglia. (C) The successful extraction of genomic DNA from 
left and right ganglia was confirmed by PCR utilizing primers designed for the murine 
housekeeping gene G3PDH (385 bp). 

15 [11] Figure 8 shows injection of FlV(lacZ) in the right TMJ (5X10^ infectious 

particles) resulted in successful transduction of primary sensory neurons with the reporter gene 
p-galactosidase in trigeminal ganglia ispilateral to the treated joint (A) p^galactosidase 
expression was detected by X-gal histochemistry in sagittal sections of right-side ganglion (4X), 
(B) primarily at its posterior and posterolateral region (20X). (C) Injection of FIV(A'lac) did not 

20 result in p-galactosidase expression. (D) The X-gal staining was confirmed with 

immunocytochemistry employing antibodies raised against bacterial p-galactosidase following 
FIV(lacZ) injection compared to (E) FIV(A'lac) treatment. 

[12] Figure 9 shows The neuronal cell line N2a was infected with HIV(HUMOR) [a 
Lenti virus]. Total RNA was extracted from infected and control cells. The levels of HUMOR 
25 and G3PDH transcript were assessed by RT-PCR. Minimal amounts of HUMOR were detected 
in naive cells (CI, C2), as well as cells infected with the HIV(lac25 virus (LI, L2). In contrast, 
HUMOR was readily detected in cells infected with HIV(HUMOR). As a control, flie 
housekeeping gene G3PDH transcript was detected in all samples analyzed. 
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IV. DETAILED DESCRBETION 

[13] Before the present compounds, compositions, articles, devices, and/or methods 
are disclosed and described, it is to be understood that the disclosure is not limited to specific 
synthetic methods or specific recombinant biotechnology methods unless otherwise specified, or 
5 to particular reagents unless otherwise specified, as such may, of course, vary. It is also to be 
understood that the terminology used herein is for the purpose of describing particular 
embodiments only and is not intended to be limiting. 

[14] Disclosed are the components to be used to prepare the disclosed compositions 
as well as the compositions themselves to be used within the methods disclosed herein. These 

10 and other materials are disclosed herein, and it is understood that when combinations, subsets, 
interactions, groups, etc, of these materials are disclosed that while specific reference of each 
various individual and collective combinations and permutation of these compounds may not be 
explicitly disclosed, each is specifically contemplated and described herein. For example, if a 
particular ji-opioid receptor vector is disclosed and discussed and a number of modifications 

15 that can be made to a niunber of molecules including the ^.-opioid receptor vector are discussed, 
specifically contemplated is each and every combination and permutation of the fi-opioid 
receptor vector and the modifications that are possible unless specifically indicated to the 
contrary. Thus, if a class of molecules A, B, and C are disclosed as well as a class of molecules 
D, E, and F and an example of a combination molecule, A-D is disclosed, then even if each is 

20 not individually recited each is individually and collectively contemplated meaning 
combinations, A-E, A-F, B-D, B-E, B-F, C-D, C-E, and C-F are considered disclosed. 
Likewise, any subset or combination of these is also disclosed. Thus, for example, the sub- 
group of A-E, B-F, and C-E would be considered disclosed. This concept applies to all aspects 
of this application including, but not limited to, steps in methods of making and using the 

25 disclosed compositions. Thus, if there are a variety of additional steps that can be performed it 
is understood that each of tiiese additional steps can be performed with any specific embodiment 
or combination of embodiments of the disclosed methods. 
A. Definitions 

[IS] As used in the specification and the upended claims, the singular forms "a," 
30 "an" and "the" include plural referents unless the context clearly dictates otherwise. Thus, for 
example, reference to "a pharmaceutical carrier" includes mixtures of two or more such carriers, 
and the like. 
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[ 1 6] Ranges can be expressed herein as fiom "about" one particular value, and/or to 
"about" another particular value. When such a range is expressed, another embodiment includes 
from the one particular value and/or to the o&er particular value. Similarly, when values are 
expressed as approximations, by use of the antecedent "about,'* it will be understood that the 
5 particular value forms another embodiment It will be further understood that the endpoints of 
each of the ranges are significant both in relation to the other endpoint, and independently of fhe 
other endpoint It is also understood that there are a number of values disclosed herein, and that 
each value is also herein disclosed as ''about" that particular value in addition to the value itself. 
For example, if the value "10" is disclosed, then "about 10" is also disclosed. It is also 

10 understood that when a value is disclosed that "less than or equal to" the value, "greater than or 
equal to the value" and possible ranges between values are also disclosed, as appropriately 
understood by the skilled artisan. For example, if the value "10" is disclosed the "less than or 
equal to 10"as well as "greater than or equal to 10" as well as "less than" and "greater than" 10 
are also disclosed. It is also und^ood that the throughout the application, data is provided in a 

15 number of different formats, and that this data, represents endpoints and starting points, and 
ranges for any combination of the data points. For example, if a particular data point "10" and a 
particular data point 1 5 are disclosed, it is understood that greater than, greater than or equal to, 
less than, less than or equal to, and equal to 10 and 1 S are considered disclosed as well as 
between 10 and IS; 

20 [17] In this specification and in the claims which follow, reference will be made to a 

number of terms which shall be defined to have the following meanings: 

[18] "Optional" or "optionally** means that the subsequently described event or 
circumstance may or may not occur, and that the description includes instances where said event 
or circumstance occurs and instances where it does not 

25 [19] "Primers" are a subset of probes which are capable of supporting some type of 

enzymatic manipulation and which can hybridize with a target nucleic acid such that the 
enzymatic manipulation can occur. A primer can be made from any combination of nucleotides 
or nucleotide derivatives or analogs available in the art which do not interfere with the 
enzymatic manipulation. 

30 [20] "Probes" are molecules capable of interacting with a target nucleic acid, typically 

in a sequence specific manner, for example through hybridization. The hybridization of nucleic 
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acids is well understood in the art and discussed herein. Typically a probe can be made from 
any combination of nucleotides or nucleotide derivatives or analogs available in the art. 

[21] Throughout this application, various publications are referenced. The disclosures 
of these publications in their entireties are hereby incorporated by reference into this application 
5 in order to more fiiUy describe the state of the art to which this invention pertains. The 

references disclosed are also individually and specifically incorporated by reference h^iein at 
least for the material contained in them tfiat is discussed in the sentence in which the ref^ence 
is relied upon. 

B. Con^positions and methods 

10 [22] The |aK)pioid receptor (HUMOR) is a key component of the intrinsic anti- 

nociceptive pathway in mammals: descending bulbospinal serotonergic aad noradrenergic 
neuronal projections exert anti-nociceptive effects via release of endogenous opioids, which in 
turn activate ja-opioid receptors present on the presynaptic membrane of the primary sensory 
neurons. Pain stimulus travels through the nerve to the brain through activation of nociceptors. 

15 The activation of {^-opioid receptors through binding of opioids interrupts the transmission of ' 
the pain signal. Mammals release endogenous opioids when under pain assault and billions are 
spent each year in the pharmaceutical industiy to treat pain through the administration of opioids 
and opioid like molecules that target the fx-opioid receptors. Disclosed herein are compositions 
and methods for the treatment of pain; which do not require the administration of (x-opioid 

20 receptor targeted molecules or utilize lower efTective amounts of opioid receptor targeted 

molecules. The disclosed methods involve the over-expression of |Li-opioid receptors, which can 
make the nerve cell more recq>tive to endogenous opioid molecules or to opioids or opioid like 
molecules administered as a pharmaceutical. Over expression of the ^-opioid receptors can 
occur through 'simulation of endogenous opioid receptor genes or through transgenic therapy 

25 that delivers a constmct encoding the opioid receptor. 

[23] Receptor up-regulation is designed to result in circumventing the observed 
desensitization following prolonged opioid drug administration, which in part occurs as a 
decrease in receptor expression. Furthermore, the strategy can take advantage of the existing 
intrinsic anti-nociceptive mechanism by ensuring adequate ji-opioid receptor presence at the site 
30 of the central processing of pain. This adequate receptor presence is consistent with heightened 
sensitivity of patients to drugs administered exogenously, which is consistent with requiring 
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smaller doses of opioid analgesics, such as Ultram, Fentanil, and Darvon, which otherwise 
commonly result to pathologic addiction. 

[24] Disclosed are compositions and methods to target Hbc expression of opioid 
receptors, such as the |i-opioid receptors, such as human opioid receptors, to sensoiy neurons 
5 innervating regions that can experience pain, such as oro&cial regions that e>q)erience 

nociception. Disclosed are compositions and methods for targeting opioid receptors, such as the 
^.-opioid receptors, expression in sensory orofacial neurons. Also disclosed are compositions 
and methods for targeting opioid receptors, such as the (x-opioid receptors, expression in any 
sensory neuron. For example, the compositions and methods can be used in any sensory neuron, 

10 wherein the sensory neuron processes pain or other "input* * signals fix^m peripheral tissues (e.g., 
joints, amputated limbs, extracted or endodontically treated teeth), as well as vital organs. For 
example, disclosed are vectors, such as feline immimodeficiency lentiviral vectors (FIV), rAAV 
vectors, HSV Amplicon, and liposomes for delivery of the opioid receptor DNA. 
Administration of the vectors peripherally to infect those sensory neurons, such as those 

15 innervating the orofacial region, can be performed For example, the vectors can be delivered at 
the point of pain, for example, an extremity, by for example, injection into the extremity. 
Disclosed are vectors, such as FIV, rAAV, HSV Amplicon, and liposomes, capable of stably 
transducing terminally differentiated cells, including neurons. 

a) Nervous system 

20 [25] The nervous system can be divided into two parts: central and peripheral. The 

central nervous system consists of the encephalon or bmin and the medulla spinalis or spinal 
cord. These two parts, the brain and the spinal cord are continuous with on another at the level 
of the upper border of the atlas vertebra. The peripheral nervous system consists of a series of 
nerves, which connect the central nervous system to aU of the tissues in the body. Nerves also 

25 are often grouped as cerebrospinal and sympathetic. However, since the two groups are 

intimately connected and closely intermingled these distinctions are not absolute. Nerve cells 
can also be classilSed as efferent or afferent nerves. Efferent nerve cells are nerve cells that 
transmit signals from the brain to the periphery and afferent nerve cells are nerve cells that 
transmit signals irom the periphery to the brain. 

30 [26] Nexurons act as pain pathways and these pathways include peripheral, spinal, and 

supraspinal elements. The peripheral part of the system includes the primary afferent sensoiy 
neurons. These neurons are called nociceptors, and can be found throughout the body, such as 
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in the skin, muscle, connective tissue, the cardiac system, and abdominal and thoracic viscera. 
Nociceptors are uncapsulated nerve endings that detect thermal, mechanical, or chemical 
stimuli, and are thus, not small molecule receptors. Nociceptors can be thinly myelinated or 
unmyelinated nerve fibers. The thinly myelinated variety are termed A-delta fibers and the 

5 unmyelinated variety are termed C-polymodal fibers. The primary functional difiference 

between A and C delta fibers is that A-delta fibers are rapidly conducting and C delta fibers are 
slowly conducting. This means that A delta fibers transmit sensations perceived as &ist, sliarp, 
well-localized pricking pain, and C-polymodal fibers transmit feeling via thermal, mechanical, 
and chemical stimuli transmitting sensations perceived as dull, aching, burning, poorly localized 

10 pain. 

[27] Most A-delta and the C-polymodal afferent fibers enter the dorsal horn of the 
. spinal cord by way ofthe dorsal nerve roots and their ganglia. Wide dynamic range neurons • 

receive nociceptive and non-nociceptive iiq>ut fi:om the skin, muscle, and viscera. This 
. convergence can account for visceral referred pairL Impulses are then transmitted to the brain 
15 . by the spinal thalamic tract (STT). Near the thalamus, the STT bifurcates into the 
. neospinothalamic tract and the paleospinothalamic tract, projecting to the thalamus, 
hypothalamus, periaqueductal gray matter (PAG) in the brain stem. The thalamus processes 
sensory input is projected to the cerebral cortex, basal ganglia, and limbic system. Descending 
pafiiways conduct transmission firom the brain to the spinal cord control and modify afferent 
20 sensory input. 

[28] Nociception can be thought of as the detection of tissue damage by nocicqptors. 
Modulation of nociception occurs periphemlly, spinally, and supraspinally. Tissue damage is 
associated with the release of chemical mediators, such as serotonin, histamine, bradykinin, 
cytokines, prostaglandins, and leukotrienes, which produce inflammation, and occurs in the 
25 peripheral system. The pain transmission is modulated by these events and this lowers 

excitability threshold of the nociceptor threshold so that stimuli normally non-painful stimuli 
become painful. This is called nociceptor sensitization. Two other substances that sensitize 
nociceptors are substance P and glutamate, which can be released fiom nerve terminals. 

[29] The signals fi^om the nociceptors are processed in the dorsal horn of the spine. 
30 Repetitive, convergent input tcom A-delta and C polymodal fibers at the dorsal horn can result 
in a state wh^ less stimulation is required for the generation of a pain response. This is known 
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as the wind-up phenomenon, and is thought to be initiated by the release of substance P and the 
excitatory amino acids glutamate and aspartate. 

[30] The brain also signals the spinal cord to modulate fhe pain response. The PAG 
region of fhe brainstem contains hig^h concentmtions of opioid receptors, and sends projections 
5 to the rostral medulla and eventually to fhe dorsal root inhibiting ascending pain impulses. 
Thus, fhe activation of flie opioid receptors intetrupts fhe transmission of fhe pain signal. 
Descending pathways can also stimulate spmal nociceptive transmission as well. 

b) Pain 

[31] Pain is typically classified into two categories: nociceptive pain (son^tic pain) 
10 and neuropathic pain. Nociceptive paia is pain that is sensed after some type of trauma. The 
nociceptive pain is sensed by die "nociceptor" sensory fibers which are connected to the nervous 
system. After an injury to a muscle, soft tissue (ligaments, tendons), bones, joints, or skin (or 
other organs), these sensory fibers are stimulated which causes a transnoission of a signal 
through an aiBferent neuron to the brain. Nociceptive pain is often characterized as a deep 
1 5 aching, throbbing, gnawing, or sore sensation. Common examples of nociceptive pain include: 
pain after trauma (e.g. a car accident or a &11), postoperative pain, and arthritis pain. 
Nociceptive pain is usually localized and gets better with healing. 

[32] Neuropathic pain is pain caused by dama^ to nerve tissue. NeuropaHiic pain is 
oft:en characterized as burning, severe shooting pains, and/or persistent numbness or tingling. 
20 Common examples of neuropathic pain related to back pain include sciatica, pain that travels 
from the spine down the arm, and pain that persists after back surgery. 

[33] It is thought that in some cases prolonged nociceptive pain may progress to 
neuropathic pain, and a patient may have both nociceptive and neuropathic pain at the same 
time. Pain is also often classified as acute pain or chronic pain. Acute pain is characterized as 

25 pain where the amount of pain directly correlates with the level and duration of tissue damage. 
Acute pain therefore, provides a protective reflex, such as the reflex to move your hand 
immediately if you touch a sharp object. This type of pain is a symptom of injured or diseased 
tissue, so that when the underlying problem is cured the pain goes away. Acute pain is a form 
of nociceptive pain. Chronic pain on the other hand, does not correlate with the severity of the 

30 insult, and therefore, typically will not serve a protective fiinction. Prolonged damage to tissues, 
i.e. knee pain or tooth ache, will eventually result in plastic (non reversible) changes in the 
neurons that process pain from that area, which now facihtate either allodynia and/or 
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hyperalgesia. Chronic pain is bom following these plastic neuronal changes, whereby the 
neurons are now ''sick" and pain will occur even in the absence of peripheral stimulus (e.g., 
amputated limbs, extracted teeth). In fact, its basis is neuropathic now, and neurons 
continuously send pain messages to the brain even though there is no continuing tissue damage. 
5 Neuropathic pain is a form of chronic pain. 

(1) Anatomy of orofacial pain 

[34] The mandibular division of the trigeminal nerve provides sensory innervation to 
the TMJ and masticatory muscles. The cell bodies of these primary sensory neurons are located 
in the inferior portion of the trigeminal ganglion extending flieir unmyelinated (C-fibers) or 

10 thinly myelinated (A5-fibers) peripheral projections to structures of the face and jaws. More 
specifically, nociceptive innervation to the temporomandibular joint (TMJ) is primarily 
provided by the auriculotemporal nerve of the mandibular division of the trigeminal nerve 
(Sessle BJ, Hu JW (1991). Can JPltysiol Pharmacol 69: 617-626). A5 and C nerve fibers, 
whose cell bodies are located in the posterolateral part of the trigeminal ganglion (Yoshino K, et 

15 al. (1998). Arc Oral Biol; 43: 679-686), project distally and terminate as non-encapsulated ftee 
nerve endings dispersed throughout the posterolateral part of the TMJ capsule (Bemick S 
(1962). Oral Surg l5:4^S-492; ThilanderB (1964). ^c^a OdontScan 22:151-156; Frommer J, 
Momoe CW (1966). J Dent Res 45:1762-1766; Klmeberg I (1971). Ann Royal Coll Surg Engl 
49:268-288), the posterior band of the meniscus and the posterior attachment (Dressen D, et al. 

20 (1990). ^c/a^;m^ 139:154-160; KidoMA, etal. (1991). Arch Ora/£fo/ 36:397-400, Kido MA, 
etal. (1993).yZ>e«ri?ey 72:592-598; Wink CS, etal. (1992). J Oral Maxillofac Surg 
50:334-337). Inflammation, injury or other agents may cause excitation of the free and 
unspecialized nerve endings of the unmyelinated C-fibers, which are predominately involved in 
the transmission of nociception from the TMJ, muscles of mastication as well as the pulp of 

25 teeth. The central projections enter the brain stem via the ventrolateral pons, descend caudally 
as the dorsolateral trigeminal tract and synapse with second order sensory neurons at the 
substantia gelatinosa of the subnucleus caudalis of the descending trigeminal nucleus (medullary 
dorsal horn). Second order sensory neurons extend projections to the nucleus proprius, 
followed by subsequent projections to the intermedial gray, and then to the reticular formation 

30 of the brain stem, and through the mtralaminal nuclei of the thalamus project wide spread 

connections into the cortex. The ascending sensory neural architecture is also susceptible to an 
intrinsic opioidr-releasing anti-nociceptive descending system, the inhibitory effects of which 
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are mediated by opioid receptors expressed in the presynaptic membrane of the primary sensory 
neurons. Although pain is initially elicited at a peripheral site, it is further centrally modulated, 
i!e. in the brain, enhanced or attenuated, therefore making this aforementioned central 
processing of pain a major component in sensory orofacial nociception. 

5 [35] fia the quest for developing new Aerapies for orofacial pain, gpne ih&rapy appears 

to be an emerging treatment method (Kuboki T, et al. (1999). Arc Oral Biol 44: 701-709; Pohl 
M, Braz J (2001). Eur J PKarmacol 429: 39-48; Baum BJ, et al. (2002). JADA 133: 35-44). For 
example, it has been previously suggested that delivery of antisense oligonucleotides developed 
against nociceptive genes to appropriate tissues may offer alternatives in designing novel 
10 treatments for pain management (Wu CL, et al. (2001). Anesthesiology 95: 216-240). 

[36] Disclosed herein, transfer of anti-nociceptive genes to sensory trigeminal neurons 
innervating the orofacial region can be achieved after injection of lentiviral vectors at the 
painfiil site, such as the TMJ, resulting in their uptake by free nerve endings and retrograde 
transport to the sensory cells' nuclei. Previous studies demonstrated axonal retrograde transport 

15 of horseradish peroxidase from the TMJ to the central nervous system (Romfli JH, et al. (1979). 
Exp Neurol 65: 99-106; Capra NF (1987). Somatosensory Res 4: 201-213) including the 
trigeminal ganglia (Yoshino K, et al. (1998). Arc Oral Biol; 43: 679-686). In evaluating the 
employment of lentiviral vectors as the basis for TMJ gene therapy, as an example, VSV-G 
pseudotyped feline immunodeficiency viral vectors (FIV) were used. These vectors are capable 

20 of stably transducing dividing, growth arrested as well as post-mitotic cells, as it is capable of 
transgene integration into the host's genome (Poeschla EM, et al. (1998). Nature Med 4: 354- 
357). VSV-G pseudotyping of viral vectors confers a broad range of host specificity, including 
human and miuine cells, as infection is mediated by the interaction of the viral envelope protein 
and a phospholipid component of the cell membrane leading to membrane-fusion mediated 

25 entry (Bums JC, et al. (1993). Proc Natl Acad Sci USA 90: 8033-8037; Cameiro FA, et al. 

(2002). J Virol 76: 3756-3764). Therefore, FIV vectors can mediate sustained gene expression 
in non-dividing terminally differentiated trigeminal sensory neurons, a property unique to 
lentiviral vectors. 

c) Current pharmacologic agents in the management of pain 

30 [37] Non-steroidal anti-inflammatory drugs (NSAID's) are often utilized as the first 

line of agents for the management of pain. NSAID's primarily exert their pain-killing effects by 
inhibitmg the production of prostanoids and attrauating peripheral inflammatory conditions that 



wo 2004/073646 



PCT/US2004/004914 



may be responsible for pain elicitation. Alternatively, corticosteroids may be utilized with 
peripheral routes of action. In contrast, exogenously administered opioid drugs (morphine) 
mimic the effects of the endogenous opioids by crossing the blood brain barrier (BBB). 
Similarly, tricyclic antidepressants ttiat cross the BBB have been also employed in cases of 
5 chronic pain by inhibiting the reuptake of serotonin and norepinephrine. However, each of these 
four classes of drugs is characterized by significant side effects that prohibit their long term use 
as well as often show unfavorable treatment outcomes. 

d) Opioid receptors and mechanism of action 

[38] Opioid analgesics have been used for pain management for hundreds of years. 

10 Opium itself consists of the dried latex from the imripe fruit of the opium poppy Papaver 
somniferum. Morphine is isolated from opium. Opioid receptors exist in the spinal and 
supraspinal regions of the nervous systems. Opioids can modulate neuronal transmission by 
binding to opioid receptors in the dorsal-root ganglia, the central terminals of primary afferent 
neurons (LaMotte C, et al.. Brain Res 1976;1 12:407-12; Fields HL, et al., Nature 

15 1980;284:351-3) and peripheral sensory-nerve fibers and their terminals (Stein C, et al., Proc 
Nad Acad SciUS A i990;87:5935-9; Hassan AHS, et al.,. Neuroscience 1993;55:185-95.. 
The dorsal-root ganglia and trigeminal ganglion (Xie GX, et al. Life Sciences 1999; 64:2029- 
37; Li JL, et al.. Brain Res 1998; 794:347-52.) contain messenger RNA (mRNA) for opioid 
receptors (Schafer M, et al., Eur J Pharmacol 1995;279: 165-9; Mansour A, et al, Brain Res 

20 1994;643:245-65) and primary afferent nerves mediate the peripheral antinociceptive effects of 
morphine (Bartho L, et al., Nannyn S^cbmi edebergs Arch Pharmacol 1990;342:666-70). The 
presence of endogenous opioids and their receptors are able to produce a potent anti- 
nociception. Opioids increase potassium currents and decrease calcium currents in the cell 
bodies of sensory neurons (Werz MA, Macdonald RL., Neurosci Lett 1983:42:173-8: Schroeder 

25 IE, et al., Neuron 1991;6:13-20), both of which can lead to the inhibition of neuronal firing and 
transmitter release. A similar process occurring throughout the neuron, can explain why opioids 
attenuate both the excitability of the peripheral nociceptive terminals and the propagation of 
action potentials (Andreev N, et al, Neuroscience 1994;58:793-8; Russell NJW, et al., Neurosci 
Lett 1987;76:107-12). At central nerve terminals, (Yaksh TL, et al.. Nature 1980;286:155-7) 

30 opioids inhibit the calcium-dependent release of excitatory, pro-inflammatory compounds (e.g., 
substance P) from peripheral sensory-nerve endings, (Yaksh TL., Brain Res 1988 458:319-24) 
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which contribute to the anti-inflanmiatory actions of opioids (Barber A, Gottschlich R. et al., 
Med Res Rev 1992;12:525-62). 

[39] There are three known opioid receptors, |j, k, and 5-opioid receptors, (i- 
Receptors are further subdivided into at least two subclasses, ^1 and ji2-receptors. The body 
5 produces opioid like molecules, called endogenous opioids, such as endorphins, enkephalins, 
and dynorphins. (x-receptors are known to mediate analgesia, respiratory depression, 
bradycardia, nausea/vomiting, and decreased gastrointestinal tone. 

[40] 5-receptors mediate supraspinal analgesia, and K-receptors mediate dysphoria and 
tachycardia. As pain impulses ascend through the spinal and supraspinal nervous system, 
10 activation of the opioid receptors inhibits these impulses and inhibits the transmission of pain 
from the dorsal hom. In addition, opioid analgesics prevent the presynaptic release of pain 
mediators such as Substance P into the spinal cord region. 

[41 ] All animals, such as mammals, such as hmnan, contain opioid receptors. It is 
understood that there are homologs for the various opioid receptors across animal species. A 
15 number of different opioid receptor sequences are set forth in the SEQ IDS, including ^-opioid 
receptors. The human ix-opioid receptor is referred to herein as HUMOR. It is understood that 
if a particular statement or reference is made regarding HUMOR that fliis statement is equally 
applicable to homologous receptors, unless specifically indicated otherwise. 

[42] Opioid analgesics are typically grouped into tiuree classes: naturally occurring 
20 (morphine, codeine); semi-synthetic morphine derivatives (hydromorphone, oxycodone, 
hydrocodone); and synthetic. Synthetic opioid analgesics include moiphinan derivatives 
(levorphanol); methadone derivatives (methadone, propoxyphene); benzomorphan derivatives 
(pentazocine); and phenylpiperidine derivatives (meperidine, fentanyl, sufentanil, alfentanil, 
remifentanil). Tramadol is an opioid analgesic that also inhibits the reabsorption of 
25 norepinephrine and serotonin. 

[43] Another way to classify opioid analgesics is as agonists, partial agonists, mixed 
agonists/antagonists, and antagonists based on their interactions at the opioid receptors, ji, and 
K opioid-receptors are stimulated by agonists. Partial agonists have reduced |x-opioid receptor 
activity, and mixed agonists/antagonists only stimulate certain ^ and K-opioid receptors, 
30 Antagonists bind fi and K-opioid receptors but do not stimulate the receptor activity. 
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[44] Some agonists are Morphine, Hydromorphone, Oxymorphine, Codeine, 
Oxycodone, Hydrocodone, Dihydrocodeine, Methadone, Meperidine, Fentanyl, Sufentanil, 
Alfentanil, and RemifentaniL. An example of a partial agonist is Buprenoiphine. Pentazocine, 
Nalbuphine, and Butoiphanol are examples of mixed agonists/antagonists. Examples of 
5 antagonists are Naloxone and Nalmefene. It is understood that one way to classify opioid 
recq>tors is by which molecules act as antagonists and which act as agonists, for example. 
Thus, a receptor can be defined as "a receptor for which morphine is an agonist" 

[45] There are a number of adverse side efTects ^t can occur when taking opioid 
analgesics, such as CNS effects, such as sedation, confusion, and respiratoiy depression. 
10 Gastrointestinal adverse effects include nausea, vomiting, and constipation. Livoluntary 

muscular contmctions (twitching) known as myoclonus, bradycardia, and hypotension, can also 
occur. Lastfy, physical and psychological dependence can also occur upon use or prolonged use 
of opioid analgesics. Thus there is a significant need for the disclosed compositions and 
methods, which reduce or eliminate the need for opioid analgesics in many indications. 

15 e) ^ropioid receptor therapy 

[46] The disclosed approach for the man^ment of pain involves flie targeted 
expression of opioid recqptor(s) such as the )x-opioid receptor in the primary neurons 
innervating the areas, such as orofacial areas, experiencing pain, resulting in these same neurons 
becoming more susceptible to the intrinsic opioid anti-nociceptive mechanisms. Disclosed are 
20 compositions and methods for treating pain. The compositions comprise an opioid receptor that 
is expressed from a vector. Typically these compositions will be delivered to at the point of 
pain. It is tiiought that their expression, increases the efficiency of the body's own opioid like 
molecules and decreases pain. 

[47] Disclosed herein, flie cDNA for a human ji-opioid receptor (HUMOR) is set 
25 forth in SEQ ID NO:2. The >i-opioid receptor (Raynor K, et aL, J Pharmacol Exp Ther. 1995: 
272:423-8) has been placed into a vector herein and expressed in primary fibroblasts as well as 
cells of the N2a neuronal cell line (Figure 1). Transduction and stable expression of ^-opioid 
receptor in neurons can be accomplished by employing VSV-G pseudotyped immunodeficiency 
viral vectors (FFV). 

30 [48] The expression of flie ^-opioid receptor in the nexurons at the point of pain in 

certain embodiments requires transduction in a non-dividing cell such as a neuron. This can be 
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accomplished using a transduction mechanism, such as lipofection or encapsulation methods, or 
via viral vector systems that function with cell division, such a lentiviiuses, such as the FTV 
virus, or adeno-associated viruses, rAA V vectors, HSV Amplicon, and liposomes. 

[49] It has been previously shown that this FTV system is capable, due to its lentiviral 
5 properties, of infecting terminally differentiated cells, including neurons. Disclosed are 

methods for administering vectors, such as the FIV(|Li-opioid receptor) vector, peripherally at the 
site of pain. The neurons ixmervating that specific site and mediating the nociceptive signals are 
infected and stably transduced. These vectors, including vectors expressing lacZ and the fi- 
opioid receptor, can transduce nerve cells in vivo, in mice, through injection at the periphery. 

10 [50] Disclosed herein is the stable expression of a reporter gene, the lacZ gene, in 

neurons located in the appropriate region of the trigeminal ganglion following peripheral 
injection of FIV(lacZ) in the area of the TMJ (Figure 2), as well as a variety of expression 
vectors containing the ji-opioid receptor, such as the human ^-opioid receptor. 

[51] . Disclosed are vectors wherein the.vector includes sequence encoding the fi- 
15 opioid receptor gene. Also disclosed are vectors, wherein a ^-opioid receptor gene has been 
cloned in an FIV vector. Disclosed are methods comprising administering the disclosed vectors 
to cells, including cells involved in transmitting pain signals, such as nerve cells in the oro&cial 
regions, related to for example, pain from TMJ and the masseter muscle. 

[52] Also disclosed are transgenic mice that have been stably transfected with the 
20 disclosed vectors. These mice can be used, for example, as models of pain and the testing of 
therapeutics. 

f) Mouse model of experimental nociception 

[53] The majority of models evaluate reflex increases in jaw muscle activity, 
activating putative nociceptive pathways by the injection of algesic substances. Broton and 

25 Sessle (Boton JG, Sessle BJ, et al., Aich Oral Biol 1988;33:741 -47.) placed hypertonic saline, 
potassium chloride and histamine into the TMJ of cats and found increase in electromyographic 
(EMG) activity in the temporal, digastric and genioglossus muscles. This suggests tihat the 
products of tissue injury and or inflammation are capable of producing pain within the TMJ, 
causing excitation of sensory afferents and reflex muscle activity. Tambeli et al. (Tambeli CH, 

30 et al., J Dent Res 1997; 76: (Special Issue) absti# 1263.) injected mustard oil unilaterally into the 
TMJ and the results were similar to Broton and Sessle ^oton JG, Sessle BJ, et al. Arch Oral 
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Biol 1988;33:741-47.). Increases in masticatory muscle activity have been also demonstrated by 
injection of excitatory amino acids into the TMJ area. Cairns et al. (Cairns BE, et al., J 
Neurosci 18:1998:8056-64. Caims BE. et al.. J Neurophvsiol 1999: 81 : 1966-69.) injected 
glutamate into the TMJ area of rats and in a dose dependent manner induced a prolonged 
5 increase in EMG activity by the excitation of nociceptors (A delta and C fibers) (Camis BE, et 
al., J Neurophvsiol 2001: 86: 782-90), Caims etal. (Caims BR et al., J Neurpphvsiol 
8S;2001 :2446-S4.) placed glutanaate into the masseter muscle of the rat and human and evoked 
higher masseter muscle activity in female than males of bofh species. These results indicate that 
peripheral insult can produce pain behavior and changes in resting muscle activity when 

10 algogenic substances are injected into the TMJ and masticatory muscles. Disclosed are 

behavic»:al models to assess orofacial pam fiom the TMJ and masseter muscle via application of 
glutamate into the TMJ and masticatory muscles. These models can include mice, disclosed 
herein, which have had vectors encoding |i-opioid receptor stably transduced. Resistance to jaw 
opening and EMG activity can serve as behavioral measures of pain. 

15 C. Compositions 

1. opioid receptors 

[54] There are typically considered three classes of opioid receptor |a, 5 and k. Genes 
encoding for these receptors have been cloned (Evans et al (1992) Science 258 1952; Kieflfer et 
al (1992) Proc.Natl.Acad.Sci.USA 89 12048; Chen et al (1993) MoLPharmacoL 44 8; and 

20 Minami et al (1993) FEES Lett. 329 291 all of which are herein incorporated by reference for 
material related to opioid receptors and there sequence). In addition, an orphan receptor was 
identified which has a high degree of homology to the known opioid receptors and based on 
structural grounds it is considered a receptor called ORLl (opioid receptor-like) (Mollereau et al 
(1994) FEES Lett. 341 33, herein incorporated by reference for material related to opioid 

25 receptors and there sequence). Since the cloned receptors function as opioid receptors, by for 
example interacting with pertussis toxin-sensitive G-proteins, all of the cloned opioid receptors 
possess the same general structure which includes an extracellular N-terminal region, seven 
transmembrane domains and intracellular C-terminal tail stmcture. Evidence obtained from 
phannacokinetic and activity data indicate there are subtypes of each receptor and other types, 

30 such as less well-characterized opioid receptors, such as 8, X, t, ^, which are known. One way 
of characterizing the different receptor subtypes for 5- and K-receptors is through different 
post-translational modifications of the gene product (glycosylation, palmitoylation. 
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phosphorylation, etc). Also receptor dimerization to form homomeric and heteromeric 
complexes or from interaction of the gene product with associated proteins such as RAMPs can ' 
effect function, and thus represent another way to characterize the receptors. Different opioids 
have different affinity for the different opioid receptors. For example, /i-morphine, 5- 
5 leukenkephalin metenkephalin, K-dynoiphin, jS-endorphin, have different affinities for 
thevarious opioid receptors. 

a) jJrReceptor subtypes 

[55] The MOR-1 gene, encoding for one form of the ^-receptor, shows approximately 
50-70% homology to the genes encoding for the 5-(D0R-l), ^-(KOR-l) and orphan (ORLl) 

10 receptors. Two different splice variants of the MOR-1 gene have been cloned, and they differ 
by 8 amino acids in the C-terminal tail which are either present or not. The sphce variants 
exhibit differences in their rate of onset and recovery from agonist-induced internalization but 
their pharmacology does not appear to differ in ligand binding assays. A MOR-1 knockout 
mouse has been made and the mouse does not respond to morphine, by failing to alleviate pain, 

15 and by failing to exhibit positive reinforcing properties or an ability to induce physical 
dependence in flie absence of the MOR-1 gene. This indicates that at least in this species, 
morphine's analgesia is not mediated through 5- or K-receptors. (Matthes et al (1996) Nature 
383 818). 

[56] The |Li receptor is divided into the fil and ij2 groups. The division occurs because 
20 of binding and phaimaco activity studies which indicate, for example, that naloxazone and 

naloxonazine abolish the binding of radioligands to the ^il-site, and in vivo studies showed that 
naloxazone selectively blocked morphine-induced antinociception but did not block morphine- 
induced respiratory depression or the induction of morphine dependence, indicating different 
types of p.-receptor (Liag et al (1984) Science 226 462 and Ling et al (1985) 
25 J.Pharmacol.Exp.Ther. 232 149). Subsequent work ia other laboratories has foiled to conjBrm 
this classification. 

[57] Peptide sequences of the human and mouse |i receptor are set forth in SEQ ID 
Nos 1 and 3 respectively. 

[58] There is also data consistent with a third form of receptor where analogues of 
30 morphine with substitutions at the 6 position (e.g. morphine-6b-gIucuronide, heroin and 6- 
acetyl morphine) are agonists, but with which morphine itself does not interact (Rossi et al 



— 17 — 



wo 2UU4/073646 



PCT/US2004/004914 



(1996) NeuToscience Letters 216 1, herein incorporated by reference for material at least related 
to opioid receptors and their function and structure). Antinociception tests on mice show that 
morphine does not exhibit cross tolerance with morphine-6b-glucuTonide, heroin or 6-acetyl 
morphine. Furthermore, in mice of the CXBX strain morphine is a poor antinociceptive agent 

5 whereas morphine-6b-glucuronide, heroin and 6-acetyl morphine are all potently 
antinociceptive. Heroin and morphine-6-glucuronide, but not morphine, still produce 
antmociception in MOR-1 knockout mice in which the disruption in the MOR-1 gene was 
engineered in exon-1 (Schuller et al (1999) Nature Neuroscience 2 151). Furfhemiore, all three 
agonists were ineffective as antinociceptive agents, in MOR-1 knockout mice in which exon-2, 

10 not exon-1, had been disrupted. This indicates that the antinociceptive actions of heroin and 
moiphine-6-glucuromde in the exon-1 MOR-1 mutant mice are mediated through a receptor 
produced from an alternative transcript of the MOR-1 gene differing fiom the MOR-1 gene 
product the fx-opioid receptor, in the exon-1 region. 

b) ^-Receptor subtypes 

15 [59] Only one 5-receptor gene has been cloned (DOR-1), but overlapping 

subdivisions of 5-receptor have been proposed (51/52 and 5cx/5ncx) on the basis of in vivo and 
in vitro pharmacological experiments. 

[60] The 5 receptor subclasses arise from pharmacological studies. Results from in 
vivo rodent studies are shown in Table 1 . 



20 



[61] Table 1. 





Agonist 


Competitive antagonist 


Non-competitive 
antagonist 


51 


DPDPE/DADLE 


BNTX(7- 

benzylidenenaltrexone) 


DALCE ([D- 
Ala2, D- 

Leu5]enkephalyl- 
Cys) 


62 


Deltorphinll/ 
DSLET 


Naltriben 


5 '-NTH 
(naltrindole 5*- 
isothiocyanate 



[62] There are a number of different ligands for the opioid receptors which 
differentially bind one or more receptors. Examples of these ligands are shown in Table 2. 



Receptor type >x-Receptor 5 -Receptor k -Receptor 



ORLi 
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Selective agonists 


endomorphin- 

lendomoiphin- 

2DAMGO 


[D-Ala^]- 
deltorphin I 
[D-Ala^]- 
deltorphin II 

SNC80 


enadoline 

U-50488 

U-69593 


nociceptin / 
OFQAc- 
RYYRWK- 
NH2* 


Selective 

antagonists 

CTAP 
naltrindole 

TIPP 


via 174864 
nor 


KfnaltnrnliiiTiiTift 


^ wlvV U V w 

antagonists 
CTAP 
naltrindole 

UPP 




Radioligands 


[3H] 


DAMGO 

[3H] 


[^-enadoline 
(^Hl-U69593 


nociceptin 



[63] Table 2 



[64] The phaimacological properties of the cloned DOR- 1 receptor are somewhere 
between ttiose predicted for either the 51 or 52 subtypes. Mouse and human recombinant 
receptors both bind DPDPE and deltorphin n, which can displacer of [3H]-diprenorphine. This 
5 is different than either a 51 or 52 classification (Law et al (1994) J.Phannacol.Exp.Ther. 271 
1686). [3H]-diprenorphine, binding to the mouse recombinant receptor, however, is more highly 
displaced by naltriben than BNTX, consistent with it being 52 like. 

[65] Opioid receptors have also been indicated to be in complex ^-receptors and k- 
receptors. For example, one type of 5 receptor subtypes complexes, 6cx, and another appears 
10 not to complex, 5ncx (Rothman et al (1 993) In: Handbook Exp.Pharmacol. Ed. A. Herz 1 04/1 
p217). 

c) K-Receptor 

[66] The cloned K-Receptor has the sequence set forth in SEQ ID NO: 5, which 
represents an example of a K-receptor. 

15 d) The orphan opioid receptor 

[67] The orphan receptor has been identified iii three species: rat, mouse and man, all 
having a greater flian 90% identity with each other. This receptor is typically referred to as 
ORL-1 for orphan receptor like 1 . The endogenous peptide agonist for ORLl is known as ' 
nociceptin or orphanin FQ. While the ORLl receptor has structural homology to orphan 
20 receptors it does not have.pharmacological homology. Non-selective ligands that exhibit high 
ajBSnity for all jj.-, k- and 5-receptors, have very low aflfmity for the ORLl receptor. Comparison 
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of the deduced amino-acid sequences of the four receptors highlights structural differences 
consistent with the lack of coligand binding. The trans-membrane regions are conserved near 
their top in the fx-, k- and 5-receptors, but axe altered in ORLl . Site-directed mutants of ORLl 
towards the traditional receptors (rat) are able to bind the traditional receptor*s ligands. For 
5 example, benzomorphan bremazocine binds ORLl by changing AIa213 in TMS to the 

conserved Lys of n, k and 5, or by changing the Val-Ghi-Val276-278 sequence of TM6 to flie 
conserved Ile-His-Ilemotif(Meng at al (1996) LBioLChem. 271 32016). There are also a 
number of splice variants of the ORLl receptor, XOR (Wang et al (1994) FEES Lett. 348 75) 
with a long form (XORIL) containing an additional 28 amino acids in the third extracellular 
10 loop and in the homologous receptor from mouse, KOR-3, five splice variants have been 
reported to date (Pan et al (1998) FEES Lett. 435 65). 

e) Endogenous Ligands 

[68] In mammals the endogenous opioid peptides are mainly derived from four 
precursors: pro-opiomelanocortin, pro-enkephalin, pro-dynorphin and pro-nociceptin/orphanin 

1 5 FQ. Nociceptin/orphanin FQ is processed from pro-nociceptin/orphanin FQ and is the 

endogenous hgand for the ORLl -receptor; it has little affinity for the ji-, S- and K-receptors. 
Table 3 sets forth endogenous ligands for the opioid receptors. These peptides bind j4, 6- and k- 
receptors with different afSniiy, and have negligible afSnity for ORLl -receptors, but none binds 
exclusively to one opioid receptor iype. P-endorphin is equiactive at p,-and 5-receptors with 

20 much lower affinity for K-receptors; the post-translational product, N-acefyl-p-endorphin, has 
very low afiSnity for any of flie opioid receptors. [Met]- and [Leu]enkephalin have high 
afBnities for S-receptors, ten-fold lower affinities for ^.-receptors and negligible affinity for k- 
receptors. Other products of processing of pro-enkephalin, which are N-terminal extensions of 
[Metjenkephalin, have a decreased preference for the 5-receptor with some products, e.g. 

25 metorphamide displaying highest affinity for the }i-receptor. Ihe opioid fragments of pro- 
dynorphin, particularly dynorphin A and dynorphin B, have higjh afifinily for K-receptors but also 
have significant affinity for |i- and 8-receptors. 

[69] Endomorphin-1 and endomorphin-2 are putative products of an as yet 
unidentified precursor, that have been proposed to be the endogenous ligands for the ^-receptor 
30 where they are highly selective. The endomorphins are amidated tetrapeptides and are 

structurally unrelated to the other endogenous opioid peptides (Table 3). Although the study of 
the cellular localization of these peptides is at an early stage, endomorphin-2 is found in discrete 
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regions of rat brain, some of which are known to contain high concentrations of /i-receptors. 
Endomorphin-2 is also present in primary sensory neurones and the dorsal horn of the spinal 
cord where it could fimction to modulate nociceptive input. 

[70] In comparison to the mainly non-selective mammalian opioid peptides (the 
5 exceptions being the endomorphins), amphibian skin contains two families of D-amino acid- 
containing peptides that are selective for p.- or 5-receptor5. Dermorphin is a |i-selective 
heptapeptide Tyr-]>Ala-Phe-Gly-Tyr-Pro-Ser-NH2 without significant aflBbndty at d- and k- 
receptoTs. In contrast, the deltorphins - deltoiphin (dermenkephalin; lyr-D-Met-Phe-His-Leu- 
Met-Asp-NH2), [D-Ala2]-deltorphin I and [D-Ala2]-deltorphin n (Tyr-D-Ala-Phe-Xaa-Val- 
10 Val-Gly-NH2, where Xaa is Asp or Glu respectively) - are highly selective for 5-opioid 
receptors. Table 3 shows a variety of endogenous opioid receptor molecules. 



[71] Tables. 



Precursor 


Endogenous peptide 


Amino acid sequence 


Pro-opiomelanocortin 


P-Endorphin 


YGGFMTSEKSQTPLVTL- 
KKNAHKNAYKKGE 


Pro-enkephalin 


[Met]enkephalin 
[Leu]enkephalin 

Metorphamide 


YGGFM . 

YGGEL 

YGGFMRF 

YGGFMRGL 

YGGFMRRV-NH2 


Pro-dynorphin 


Dynorphin A 
DynoxphinA(l-8) 
Dynorphin B 
a-neoendorphin 
P-neoendorphin 


YGGFLRRIRPKLKWDNQ 

YGGFLRRI 

YGGFLRRQFKNATT 

YGGFXRKYPK 

YGGFLRKYP 


Pro-nociceptin / OFQ 


Nociceptin 


FGGFTGARKSARKLANQ 


Pro-endomoxphin* 


Endomorphin- lEndomorphin- 
2 


YPWF-NH2 
YPFF.NH2 



[72] Opioid receptor activation produces a wide array of cellular responses (Table 4). 
For example, there are Dkect G-protein bg or a subunit-mediated effects, such as activation of 



15 an inwardly rectifying potassium chastmel, inhibition of voltage operated calcium channels (N, P, 
Q and R type), inhibition of adenylyl cyclase, Responses of unknown intermediate mechanism, 
activation of PLA2, activation of PLC b (possibly direct G protein bg subunit activation), 
activation of MAPKinase, activation of large conductance calcium-activated potassium 
channels, activation of L type voltage operated calcium channels, inhibition of T type voltage 

20 operated calcium channels, and direct inhibition of transmitter exocytosis. There are also 
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responses in other effector pathways, such as activation of voltage-sensitive potassium channels 
(activation of PLA2), inhibition of M channels (activation of PLA2), inhibition of the 
hyperpolarisation-activated cation channel (Ih) (reduction in cAMP levels following inhibition 
of adenylyl cyclase), elevation of intracellular free calcium levels (activation of PLCb, 
5 activation of L type voltage operated calcium conductance), potentiation of NMDA currents 
(activation of protein kinase C)» inhibition of transmitter release (inhibition of adenylyl cyclase, 
activation of potassium channels and inhibition of voltage operated calcium channels), 
decreases in neuronal excitability (activation of potassium channels), increases in neuronal 
iBring rate (inhibition of inhibitory transmitter release - disinhibition), and changes in gene 
10 expression (long-term changes in adenylyl cyclase activity, elevation of intracellular calcium 
levels, activation of cAMP response element binding protein (CREB)). 

2. Compositions for treating pain 

[73] Disclosed are constructs and vectors for expressing one or more opioid receptors 
in a cell, such as a nerve cell, such as a peripheral nerve cell. As discussed herein, opioid 

15 receptors are typically expressed in the spinal or supraspinal nerve cells, and the periphery 

typically do not express these receptors. The disclosed compositions and methods are designed 
to express the opioid receptors in nerve cells which are damaged or transmitting because of 
trauma, but which do not have endogenous opioid receptors or insuflBcient numbers of 
endogenous receptors to react to the endogenous opioid like molecules, typically in the 

20 periphery of the nerve cell. Thus, the expression of the opioid receptors in the nerve cell near the 
point of pain, will typically increase the amoimt of opioid receptors in this area and thus, 
increase the responsiveness to endogenous opioid like molecules. By expression of the opioid 
receptors, the sensation of pain can be reduced, not by administration of opioid analgesics, but 
rather by more efficient use of endogenous opioid like compounds. It is understood, however, 

25 that opioids, opioid like molecules, and/or other pain alleviating molecules can be added in 
addition to the disclosed opioid receptors. 

[741 Disclosed are methods wherein administration occurs in the intra-articular region 
of the jaw. The results shown herein demonstrated that intra-articular injection of FIVOacZ) 
resulted in successful gene transfer to articular TMJ surfaces as well as the joint meniscus. 
30 Thus, disclosed are methods, wherein the administration of the disclosed vectors, results in 
delivery to the articular TMJ sur&ces and the joint meniscus. 
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[75] Nociceptive innervation to the temporomandibular joint (TMJ) is primarily 
provided by the auriculotemporal nerve of the mandibular division of the trigeminal nerve 
(Sessle & Wu, 1991). A5 and C nerve fibers, whose cell bodies are located in the posterolateral 
part of the trigeminal ganglion (Y oshino et al., 1998), project distally and terminate as non- 
S encapsulated firee nerve endings dispersed throughout the posterolateral part of the TMJ capsule 
(Bemick, 1962; Thilander, 1964; Froimner & Monroe, 1966; Klineberg, 1971), the posterior 
band of the meniscus and the posterior attachment (Dressen et al., 1990; Kido et al., 1991 , 1993; 
Wink et al., 1992). Transfer of anti-nociceptive genes to sensory trigeminal neurons innervating 
the oro&cial region can be achieved after injectioD of lentiviral vectors at the painful site, such 
10 as the TMJ, resulting in their uptake by fiee nerve ending? and retrograde transport to the 

sensory cells' nuclei. Previous studies demonstrated axonal retrograde transport of horseradish 
peroxidase from the TMJ to the central nervous system (Romfh et al., 1979; Carpa, 1987) 
includxDg the trigeminal ganglia (Yoshino et al., 1998). 

[76] Disclosed are constructs capable of expressing any of the opioid receptor gene 
15 products. 

[77] Disclosed are constructs capable of expressing opioid receptors, such as the ^- 
opioid receptor gene product. The n-opioid receptor construct allows for synthesis of (i-opioid 
receptor protein. 

[78] The fi-opioid receptor constmct typically comprises three parts: 1) a promoter, 2) 
20 the ^-opioid receptor coding sequence, and 3) polyA tail. The poly A tail can be from the 

bovine growth hormone or any polyA tail including synthetic poly A tails. The Bovine growth 
hormone poly A tail carries elements that not only increase expression, but also increase 
stability of any gene construct These three parts can be integrated into any vector delivery 
system, which is capable of transducing terminally differentiated cells, such as nerve cells. 

25 [79] The promoter can be any promoter, such as those discussed herein. It is 

understood as discussed herein that there are functional variants of opioid receptors, such as the 
^-opioid receptor protein which can be made. In certain embodiments the promoter is going to 
be a cell specific promoter, such as a nerve cell specijBc promoter, such as the neuron specific 
enolase promoter. Other promoters are disclosed herein. 

30 [80] The promoter can be any promoter, such as those discussed herein. It is 

understood as discussed herein that there are functional variants of opioid rec^tors, such as the 
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|i-opioid receptor protein which can be made. In certain embodiments the promoter is going to 
be a cell specific promoter, such as a nerve cell specific promoter, such as the neuron specific 
enolase promoter. 

[81] ^-opioid receptor cDNA can be obtained firom the American Tissue Culture 
5 Collection. (AmericanTissue Culture CoUection, Manassas, VA 20110-2209; fi-opioid receptor 
ATCC#. RaynorK,etal., Characterization ofthe cloned human mu opioid receptor. J 
Pharmacol Exp Ther. 1995; 272:423-8.) 

[82] Also disclosed are constructs encoding for the human or mouse |Li-opioid 
receptor, as weU as the p-galactosidase reporter gene (/acZ). 

10 [83] Disclosed are nucleic acids comprising sequence encoding |i-opioid receptor. 

Also disclosed are nucleic acids, wherein the nucleic acid further comprises a promoter 
sequence, wherein the |a-opioid receptor has at least 80% identity to the sequence set forth in 
SEQ ID N0:2 or 4,where]n fbe fi-opioid receptor has at least 85% identity to the sequence set 
forth in SEQ ID N0:1 or 3, wherein the )i-opioid recqptor has at least 90% identity to the 

15 sequence set forth in SEQ ID NO: 1 or 3, wherein the )i-opioid receptor has at least 95% identity 
to the sequence set forth in SEQ ID NO: 1 or 3, and/or wherein the |i-opioid receptor has the 
sequence set forth in SEQ ID NO: 1 or 3. 

[84] Also disclosed are vectors comprising the disclosed nucleic acids. Also 
disclosed are cells comprising the disclosed nucleic acids and vectors. Any cell can be targeted 
20 with the disclosed constructs. However, nerve cells, for example, are terminally differentiated. 
This means that they are no longer dividing. The state of a mature non-dividing nerve cell can 
define terminally differentiated cells. In terms of differentiated\stable transduction, nerve cells 
thus represent attractive targets because once DNA is integrated, there is a very low probal^ility 
that it will not remain in the ceD. 

25 [85] Also disclosed are non-human mammals comprising the disclosed nucleic acids, 

vectors, and cells disclosed herein. 

[86] Also disclosed are methods of providing ji'K)pioid receptor in a cell comprising 
transfecting the cell with the nucleic acids. 

[87] Also disclosed are method of delivering the disclosed compositions, wherein flie 
30 transfection occurs in vitro or in vivo. 
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[88] Disclosed are methods of making a transgenic organism comprising 
administering the disclosed nucleic acids» vectors and/or cells. 

[89], Disclosed are methods of making a transgenic organism comprising transfecting 
a lentiviral vector to the organism at during a perinatal stage of the organism's development. 
5 Stragtegies of producing genetically engineered pluripotent, such as embryonic, stem cells, can 
be performed with the disclosed compositions to produce engineered cells and organisms as 
dicussed herein. Furthermore by cloning strategies can be used to produce desried organisms, 
which cany one or more of the disclosed compositions. 

[90] Also disclosed are methods of treating a subject having pain comprising 
10 administering any of Uie disclosed compounds and compositions. 

[91 ] Delivery of the disclosed constructs to terminally differentiated cells is also 
disclosed. Disclosed is a pseudotyped feline immunodeficiency virus (FIV) for (i-opioid 
receptor delivery to terminally differentiated cells. Stable expression of the therapeutic gene 
aids prolonged expression, enhancing treatment efficacy and contributing to long-term 
15 therapeutic outcomes. The backbone FIV system has been shown to eJBFectively incorporate, due 
to its lentiviral properties, the transgene of interest into the host's genome, allowing for stable 
gene expression (PoescWa et al., 1998). Disclosed herein is stable expression of the reporter 
gene lacZ in N2a cells, foUowing perinatal systemic FIV(lacZ) administration. 

[92] In certain embodnnents the constructs become an integrated product with the 
20 genome of the host For example, lentiviruses, such as HIV and LEV, have flie characteristic of 
transfecting the therapeutic gene into the host chromosome, thus forming an integrated product. 
In certain embodiments, the requirement is that the vectors allow for expression in the 
periphery of the cell, such as the nerve cell, and/or at or near the point of pain. The contrast to 
integrated products is episomal products which can also be produced using, for example, HSV 
25 and AV vectors. Thus, transient expression can be beneficial. The optimal time of expression 
is correlated with the amount of product produced and amount that is needed. For example, in 
certain embodiments, expression for at least 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 30, 45, 60, 90, 120, 
ISO, or 1 80 days is desirable. 

[93] A model system for the study of these vectors is a mouse that is knockout mouse 
30 deficient in p,-opioid receptor. 
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[94] Stable transduced are cells where a nucleic acid has been integrated into the cell 
genome. 

3. Delivery of the compositions to cells 

[95] DeUvery can be applied, in general, via local or systemic routes of 
5 administration. Local administration includes virus injection directly into the region or organ of 
interest, versus intravenous (IV) or intraperitoneal (TP) injections (systemic) aiming at viral 
delivery to multiple sites and organs via the blood circulation. Previous research on the effects 
of local administration demonstrated gene expression limited to the site/organ of the injection, 
which did not extend to the rest of the body (Daly et al., 1999a; Kordower et al., 1999). 

10 Furthermore, previous studies have demonstrated successful global gene transfer to multiple 
tissues and organs in rodents and primates following viral IV and IP injections (Daly et al., 
1999b; Tamtal et aL, 2001; McCormack et al., 2001; Lipschutz et al, 2001). Disclosed herein 
ZP injection of FIV(lacZ) in mice of adult (3 months old) as well as of perinatal age (P4) 
resulted in global transfer and expression of the reporter gene lacZ in brain, liver, spleen and 

15 kidney. Also disclosed, the levels of expression achieved via IP injections were superior to 
those acquired following local administration directly into the liver. 

[96] There are a number of compositions and methods which can be used to deliver • 
nucleic acids to cells, either in vitro or in vivo. These methods and compositions can largely be 
broken down into two classes: viral based delivery systems and non-viral based delivery 

20 systems. For example, the nucleic acids can be delivered through a number of direct delivery 
systems such as, electroporation, lipofection, calcium phosphate precipitation, plasmids, viral 
vectors, viral nucleic acids, phage nucleic acids, phages, cosmids, or via transfer of genetic 
material in cells or carriers such as cationic liposomes. Appropriate means for trans&ction, 
including viral vectors, chemical transfectants, or physico-mechanical methods such as 

25 electroporation and direct diffusion of DNA, are described by, for example, Wolff, J. A., et al., 
Science, 247, 1465-1468, (1990); and Wolff, J. A. Nature, 352, 815-818, (1991). Such 
methods are well known in the art and readily adaptable for use with the compositions and 
methods described hereiiL In certain cases, the methods will be modified to specifically 
function with large DNA molecules. Further, these methods can be used to target certain 

30 diseases and cell populations by using the targeting characteristics of the carrier. 
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a) Nucleic acid based delivery systems 

[97] Transfer vectors can be any nucleotide construction used to deKver genes into 
cells (e.g., a plasroid), or as part of a general strategy to deliver genes, e.g., as part of 
recombinant retrovirus or adenovirus (Ram etal. Cancer Res. 53:83-88,(1993)). 

5 [98] As used herein, plasmid or viial vectors are agents that transport the disclosed 

nucleic adds, such as the jx-opioid receptor construct into the cell without degradation and 
include a promoter yielding expression of the |i-opioid receptor encoding sequences in the cells 
into which it is delivered. In some embodiments the vectors for the |x-opioid receptor constructs 
are derived from either a virus, retrovirus,^ or lentivirus. Viral vectors can be, for example, 

10 Adenovirus, Adeno-associated virus, Herpes virus, Vaccinia virus. Polio virus, AIDS viras, 
neuronal trophic virus, Sindbis and other RNA viruses, including these viruses with the HIV 
backbone, and lentiviruses. Also preferred are any viral families which share the properties of 
these viruses which make them suitable for use as vectors. Retroviruses include Murine 
Maloney Leukemia virus, MMLV, and retroviruses that express the desirable properties of 

15 MMLV as a vector. Retroviral vectors are able to carry a larger genetic payload, i.e., a 

transgene, such as, the disclosed |i-opioid receptor constructs or marker gene, than other viral 
vectors, and for this reason are a commonly used vector. However, they are not as useful in 
non-proliferating cells. Adenovirus vectors are relatively stable and easy to work with, have 
high titers, and can be delivered in aerosol formulation, and can transfect non-dividing cells. 

20 Pox viral vectors are large and have several sites for inserting genes, they are thermostable and 
can be stored at room temperature. A preferred embodiment is a viral vector, which has been 
engineered so as to suppress the inunune response of the host organism, elicited by the viral 
antigens. Preferred vectors of this type will carry coding regions for Interleukin 8 or 10. 

[99] Viral vectors can have higher transaction (ability to introduce genes) abilities 
25 than chemical or physical methods to introduce genes into cells. Topically, viral vectors 
contain, nonstructural early genes, structural late genes, an RNA polymerase HI transcript, 
inverted terminal repeats necessary for replication and encapsidation, and promoters to control 
the transcription and replication of the viral genome. When engineered as vectors, viruses 
typically have one or more of the early genes removed and a gene or gene/promotor cassette is 
30 inserted into the viral genome in place of the removed viral DNA. Constructs of this type can 
carry up to about 8 kb of foreign genetic material. The necessary fimctions of the removed early 
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genes are typically supplied by cell lines which have been engineered to express the gene 
products of the early genes in trans. 

(1) Retroviral Vectors 

[1 00] A retrovirus is an animal virus belonging to the virus family of Retrovixidae, 
5 including ai^ types, subfeinilies, genus, or tropisms. Retroviral vectors, in general, are 
described by Verma, LM., Retroviral vectors for gene transfer, la Microbiology-1985, 
American Society for Microbiology, pp. 229-232, Washington, (1985), which is incorporated 
by reference herein. Examples of methods for using retroviral vectors for gene therapy are 
described in U.S. Patent Nos. 4,868,1 1 6 and 4,980,286; PCX applications WO 90/02806 and 
10 WO 89/07136; and Mulligan, (Science 260:926-932 (1993)); the teachings of which axG 
incprpomted herem by reference. 

[101] A retrovirus is essentially a package which has packed into it nucleic acid cargo. 
The nucleic acid cargo carries with it a packaging signal, which ensures that the replicated 
daughter molecules will be efficiently packaged within the package coat In addition to the 

15 package signal, there are a number of molecules which are needed in cis, for the replication, and 
packaging of the replicated virus. Typically a retroviral genome, contains the gag, pol, and env 
genes which are involved in the making of the protein coat It is the gag, pol, and env genes 
which are typically replaced by the foreign DNA that it is to be transferred to the target ceU. 
Retrovirus vectors typically contain a packagmg signal for incorporation into the package coat, a 

20 sequence which signals the start of the gag transcription unit, elements necessary for reverse 
transcription, including a primer binding site to bind the tRNA primer of reverse transcription, 
terminal repeat sequences that guide the switch of RNA strands during DNA synthesis, a purine 
rich sequence 5* to the 3' LTR that serve as the priming site for the synthesis of the second 
strand of DNA synthesis, and specific sequences near the ends of the LTRs that enable the 

25 insertion of the DNA state of the retrovirus to insert into the host genome. The removal of the 
gag, pol, and env genes allows for about 8 kb of foreign sequence to be inserted into the viral 
genome, become reverse transcribed, and upon replication be packaged into a new retroviral 
particle. This amoimt of nucleic acid is sufficient for the delivery of a one to many genes 
depending on the size of each transcript It is preferable to include either positive or negative 

30 selectable markers along with other genes in the insert. 

[102] Since the replication machinery and packaging proteins in most retrovual vectors 
have been removed (gag, pol, and env), the vectors are typically generated by placing them into 



wo 20U4/073646 



PCT/US2Ua4/004914 



a packaging cell line. A packaging cell line is a cell line which has been transfected or 
transformed with a retrovirus that contains the replication and packaging machinery, but lacks 
any packaging signal. When the vector carrying the DNA of choice is transfected into these cell 
lines, the vector containing the gene of interest is replicated and packaged into new retroviral 
5 particles, by the machinery provided in cis by the helper cell. The genomes for the machinery 
are not packaged because they lack the necessary sigiials. 

(2) Adenoviral Vectors 

[103] The construction of replication-defective adenoviruses has been described 
(Berkneretal., J. Virology 61:1213-1220(1987); Massieetal., Mol. CeD. Biol. 6:2872-« 

10 2883 (1986); Haj-Ahmad et al., J. Virology 57:267-274 (1986); Davidson et al., J, Virology 
61:1226-1239 (1987); Zhang "Generation and identification of recombinant adenovirus by 
liposome-mediatedtransfection and PGR analysis" BioTechniques 15:868-872(1993)). The, 
benefit of fhe use of these viruses as vectors is that they are linuted in the extent to which they 
can spread to other cell types, since they can replicate wittiin an initial infected cell, but are 

15 unable to form new infectious viral particles. Recombinant adenoviruses have been shown to 
achieve high efficiency gene transfer after direct, in vivo delivery to airway epithelium, 
hepatocytes, vascular endothelium, CNS parenchyma and a number of other tissue sites (Morsy, 
L Clin. Invest 92:1580-1586 (1993); Kirshenbaum, J. Clin. Invest 92:381-387(1993); 
Roessler, J. Clin. Invest. 92:1085-1092 (1993); Moullier, Nature Genetics 4:154-159 (1993); 

20 La Salle, Science 259:988-990 (1993); Gomez-Foix, J. Biol. Chem. 267:25129-25134 

(1992) ; Rich, Human Gene Therapy 4:461-476 (1993); Zabner, Nature Genetics 6:75-83 
(1994); Guzman, Circulation Research 73:1201-1207 (1993); Bout, Human Gene Therapy 5:3- 
10(1994);Zabner, Cell 75:207-216 (1993); Caillaud, Eur. J. Neuroscience 5:1287-1291 

(1993) ; and Ragot, J. Gen. Virology 74:501-507 (1993)). Recombinant adenoviruses achieve 
25 gene transduction by binding to specific cell surfece receptors, after which the virus is 

internalized by receptor-mediated endocytosis, in the same manner as wild type or replication- 
defective adenovirus (Chardonnet and Dales, Virology 40:462-477 (1970); Brown and 
Burlingham, J. Virology 12:386-396 (1973); Svensson and Persson, J. Virology 55:442-449 
(1985);Seth,etal., J. Virol. 51:650-655 (1984); Seth, etal., Mol. Cell. Biol. 4:1528-1533 
30 (1984); Varga etal., L Virology 65:6061-6070 (1991); Wickham etal.. Cell 73:309-319 
(1993)). 
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[104] A viral vector can be one based on an adenovirus which has had the El gene 
removed and these virions are generated in a ceU line such as the human 293 cell line. In 
another preferred embodiment both the El and E3 genes are removed from the adenovirus 
genome. 

5 (3) Adeno-asscociated viral vectors 

[1 OS] Another type of viral vector is based on an adeno-associated virus (AAV). This 
defective parvovirus is a preferred vector because it can infect many cell types and is 
nonpathogenic to humans. AAV type vectors can transport about 4 to S kb and wild type AAV 
is known to stably insert into chromosome 19. Vectors which contain this site specific 
10 integration properly are preferred. An especially preferred embodiment of this type of vector is 
the P4.1 C vector produced by Avigen, San Francisco, CA, which can contain die herpes 
simplex virus thymidine kinase gene, HSV-tk, and/or a marker gene, such as the gene encodii^ 
the green fluorescent protein, GFP. 

[106] In another type of AAV virus, the AAV contains a pair of inverted terminal 
15 repeats (TTRs) which flank at least one cassette containing a promoter which directs cell-specific 
expression operably linked to a heterologous gene. Heterologous in this context refers to any 
nucleotide sequence or gene which is not native to the AAV or B19 parvovirus. 

[1 07] I^pically the AAV and B 1 9 coding regions have been deleted, resulting in a safe, 
noncytotoxic vector. The AAV ITRs, or modifications thereof, confer infectivity and site- 
20 specific integration, but not cytotoxicity, and the promoter directs cell-specific expression. 
United states Patent No. 6,261,834 is herein incoxporated by reference for material related to 
the AAV vector. 

[1 08] The vectors of the present invention thus provide DNA molecules which are 
capable of integration into a mammalian chromosome without substantial toxicity. 

25 [1 09] The inserted genes in viral and retroviral usually contain promoters, and/or 

enhancers to help control the expression of the desired gene product A promoter is generally a 
sequence or sequences of DNA that fimction when in a relatively fixed location in regard to the 
transcription start site. A promoter contains core elements required for basic interaction of 
RNA polymerase and transcription fectors, and can contain upstream elements and response 

30 elements. 

(4) Lentiviral vectors 
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[01] The vectors can be lentiviral vectors, including but not limited to, SIV vectors, 
HIV vectors or a hybrid construct of these vectors, including viruses with the HIV backbone. 
These vectors also include first, second and third generation lentiviruses. Third generation 
lentiviruses have lentiviral packaging genes split into at least 3 independent plasmids or 
5 constructs. Also vectors can be any viral family that shares the properties of these viruses which 
make them suitable for use as vectors. Lentiviral vectors are a special type of retroviral vector 
which are typically characterized by having a long incubation period for infection. Furthermore, 
lentiviral vectors can infect non-dividing cells. Lentiviral vectors are based on the nucleic acid 
backbone of a virus from the lentiviral family of viruses. Typically, a lentiviral vector contains 
10 the 5' and 3* LTR regions of a lentivirus, such as SIV and HIV. Lentiviral vectors also typically 
contain the Rev Responsive Element (RRE) of a lentivirus, such as SIV and HIV. 

(a) Feline immunodeficimcy viral vectors 

[1 10] One type of vector that the disclosed constructs can be delivered in is the VSV-G 
pseudotyped Feline loununodefiiciency Virus system developed by Poeschlae^ a/. (1998). This 

15 lentivirus has been shown to efGciently infect dividing, growth arrested as well as post-mitotic 
cells. Furthermore, due to its lentiviral properties, it allows for iacorporation of the transgene 
into the host's genome, leading to stable gene expression. This is a 3-vector system, whereby 
each confers distinct instructions: the FIV vector carries the transgene of interest and lentiviral 
apparatus with mutated packaging and envelope genes. A vesicular stomatitis virus G- 

20 glycoprotein vector (VSV-G; Bums et al, 1993) contributes to the formation of the viral 
envelope in trans. The third vector confers packaging instructions in trans (Poeschla et al, 
1998). FIV production is accomplished in vitro following co-transfection of the aforementioned 
vectors into 293-T cells. The FIV-rich supernatant is then collected, filtered and can be used 
directly or following concentration by centriftigjition. Titers routinely range between 10^ — lO'' 

25 btu/ml.. 

(5) Packaging vectors 

[111] As discussed above, retroviral vectors are based on retroviruses which contain a 
number of different sequence elements that control things as diverse as integration of ih& virus, 
replication of the integrated virus, replication of un-integrated virus, cellular invasion, and 
30 packaging of the virus into infectious particles. While the vectors in theory could contain all of 
their necessary elements, as well as an exogenous gene element (if the exogenous gene element 
is small enough) typically many of the necessary elements are removed. Since all of the 
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packaging and replication components have been removed from the typical retroviral, including 
lentiviral, vectors which will be used within a subject, the vectors need to be packaged into the 
initial infectious particle through the use of packaging vectors and packaging cell lines. 
Typically retroviral vectors have been engineered so that the myriad fimctions of the retrovirus 

5 are separated onto at least two vectors, a packaging vector and a delivery vector. This type of 
system then requires the presence of all of the vectors providing all of the elements in the same 
cell before an mfectious particle can be produced. The packaging vector typically carries the 
structural and replication genes derived from the retrovirus, and the delivery vector is the vector 
that carries the exogenous gene element that is preferably expressed in the target cell. These 

10 types of systems can split the packaging functions of the packaging vector into multiple vectors, 
e.g., third-generation lentivirus systems. Dull, T. et al., "A Third-generation lentivirus vector 
with a conditional packagmg system" J. Virol 72(1 1):8463-71 (1998) 

[112] Retroviruses typically contain an envelope protein (env). The Env protein is in 
essence the protein which surrounds the nucleic acid cargo. Furthermore cellular infection 
15 specificity is based on the particular Env protein associated with a typical retrovirus. In typical 
packaging vector/delivery vector systems, the Env protein is expressed from a separate vector 
than for example the protease (pro) or integrase (in) proteins. 

(6) Packaging cell lines 

[113] The vectors are typically generated by placing them into a packaging cell line. A 
20 packaging cell line is a cell line which has been transfected or transformed with a retrovirus that 
contains the replication and packaging machinery, but lacks any packaging signal. When the 
vector carrying the DNA of choice is transfected into these cell Imes, the vector containing the 
gene of interest is replicated and packaged into new retroviral particles, by the machinery 
provided in cis by the helper cell. The genomes for the machinery are not packaged because 
25 Ihey lack the necessary signals. One type of packaging cell line is a 293 cell line. 

(7) Large payload viral vectors 

[114] Molecular genetic experiments with large human herpesviruses have provided a 
means whereby large heterologous DNA fragments can be cloned, propagated and established in 
cells permissive for infection with herpesviruses (Sun et al., Nature genetics 8: 33-41, 1994; 
30 Cotter and Robertson. Curr Opin Mol Ther 5: 633-644, 1999). These large DNA viruses 
(herpes simplex virus (HSV) and Epstein-Barr virus (EBV), have the potential to deliver 
fragments of human heterologous DNA > 1 50 kb to specific cells. EBV recombinants can 
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maintain large pieces of DNA in the infected B-cells as episomal DNA. Individual clones 
carried human genomic inserts up to 330 kb appeared genetically stable The maintenance of 
these episomes requires a specific EBV nuclear protein, EBNAl, constitutively expressed 
during infection with EBV. Additionally, these vectors can be used for transfection, where large 
5 amounts of protein can be generated transiently in vitro. Herpesvirus amplicon systems are also 
being used to package pieces of DNA > 220 kb and to infect cells that can stably maintain DNA 
as episomes. 

[115] Other useful systems include, for example^ replicating and host-restricted non- 
leplicadng vaccinia virus vectors. 

10 b) Non-nndeic acid based systems 

[1 1 6] The disclosed compositions can be delivered to the target cells in a variety of 
ways. For example, the compositions can be delivered through electroporation, or through 
lipofection, or through calcium phosphate precipitation. The delivery mechanism chosen will 
depend in part on the type of cell targeted and whether the delivery is occurring for example in 
15 vivo or in vitro. 

[1 17] Thus, the compositions can comprise, in addition to the disclosed constrocts or 
vectors for example, lipids such as liposomes, such as cationic liposomes (e.g., DOTMA, 
DOPE, DC-cholesterol) or anionic liposomes. Liposomes can further comprise proteins to 
facilitate targeting a particular cell, if desired. Administration of a composition conq>rising a 

20 compound and a cationic liposome can be administered to the blood a£ferent to a target organ or 
inhaled into the respiratory tract to target cells of the respiratory tract Regarding liposomes, 
see,e.g.,Brighametal. Am. J. Resp. Cell Mol Biol 1:95-100 (1989); Feigner etal. Proc. 
Natl Acad. 5c? i7&4 84:7413-7417 (1987); U.S. Pat No.4,897,355. Furthermore, the 
compound can be administered as a component of a microcapsule that can be targeted to 

25 specific cell types, such as macrophages, or where the difEiision of the compoimd or delivery of 
the compound from the microcapsule is designed for a specific rate or dosage. 

[118] In the methods described above which include the administration and uptake of 
exogenous DNA into the cells of a subject (i.e,, gene transduction or transfection), delivery of 
the compositions to cells can be via a variety of mechanisms. As one example, delivery can be 
30 via a liposome, using commercially available liposome preparations such as UPOFECTIN, 
LIPOFECTAMINE (GIBCO-BRL, Inc., Gaithersburg, MD), SUPERFECT (Qiagen, Inc. 
Hilden, Germany) and TRANSFECTAM (Promega Biotec, Inc., Madison, WI), as well as other 
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liposomes developed according to procedures standard in the art In addition, the nucleic acid or 
vector of this invention can be delivered in vivo by electroporation, the technology for which is 
available from Genetronics, Inc. (San Diego, CA) as well as by means of a SONOPORAHON 
machine (ImaRx Pharmaceutical Corp., Tucson, AZ). 

S [119] The materials can be in solution, suspension (for example, incorporated into 

microparticles, liposomes, or cells). These can be targeted to a particular cell type via 
antibodies, recq)tors, or receptor ligands. The following references are examples of the use of 
this technology to target specific protefais to tumor tissue (Senter, et al., Bioconiugate Chem., 
2:447-451, (1991); Bagshawe, K.D., Br. J. Cancer. 60:275-281, (1989); Bagshawe, et al, Br 

10 L Cancen 58:700-703, (1988); Senter, et al., Bioconiugate Chem,, 4:3-9, (1993); Battelli, et al., 
Cancer hnm unoL Tmnninniiier^ 35:421-425, (1992); Pietersz and McKen2ie,Jfamiiindog, 
Reviews. 129:57-80, (1992); and Roffler, et al., Biochem. PhaimacoL 42:2062-2065, (1991)). 
These techniques can be used for a variety ofother specific cell types. Vehicles such as 
"stealth" and other antibody conjugated liposomes (including lipid mediated drug targeting to 

15 colonic carcinoma), receptor mediated targeting of DNA through cell specific ligands, 
lymphocyte directed tumor targeting, and highly specific therapeutic retroviral targeting of 
murine glioma cells in vivo. The following references are examples of the use of this 
technology to target specific proteins to tumor tissue (Hughes et al.. Cancer Research. 
49:6214-6220, (1989); and litzinper and Hnanp. Rinchnninft ftt Biophvsica Acta. 1104:179-187, 

20 (1 992)). la general, recqptors are involved in pathways of endocytosis, either constitutive or 
ligand induced. These receptors cluster in clathrin-coated pits, enter the cell via clathrin-coated 
vesicles, pass through an acidified endosome in which the receptors are sorted, and then either 
recycle to flie cell surface, become stored intracellularly, or are degraded in lysosomes. The 
internalization pathways serve a variety of Amotions, such as nutrient uptake, removal of 

25 activated proteins, clearance of macromolecules, opportunistic entry of viruses and toxins, 
dissociation and degradation of ligand, and receptor-level regulation. Many receptors follow 
more than one intracellular pathway, depending on the cell type, receptor concentration, type of 
ligand, ligand valency, and ligand concentration. Molecular and cellular mechanisms of 
receptor-mediated endocytosis has been reviewed (Brown and Greene, DNA and Cell Biologv 

30 10:6,399-409(1991)). 

[120] Nucleic acids that are delivered to cells which are to be integrated into the host 
cell genome, typically contain integration sequences. These sequences are often viral related 
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sequences, particularly when viral based systems are used. These viral integration systems can 
also be incorporated into nucleic acids which are to be delivered using a non-nucleic acid based 
system of deliver, such as a liposome, so that the nucleic acid contained in the delivery system 
can be come integrated into the host genome. 

5 [121] Other general techniques for integration into the host genome include^ for 

example^ systems designed to promote homologous recombination with the host genome. These 
systems typically rely on sequence flanking the nucleic acid to be expressed that has enough 
homology with a target sequence within the host cell genome that recombination between the 
vector nucleic acid and the target nucleic acid takes place, causing the delivered nucleic acid to 
10 be integrated into the host genome. These systems and the methods necessary to promote 
homologous recombination are known to those of skill in the art 

c) In vivo/ex vivo ^ 

[1 22] As described herein, the compositions can be administered in a pharmaceutically 
acceptable carrier and can be delivered to the subject*s cells in vivo and/or ex vivo by a variety 
15 of mechanisms well known in the art (e.g., uptake of naked DNA, hposome fusion, 
intramuscular injection of DNA via a gene gun, endocytosis and the like). 

[123] If eix vivo methods are employed, cells or tissues can be removed and maintained 
outside the body according to standard protocols well known in the art. The compositions can 
be introduced into the cells via any gene transfer mechanism, such as, for example, calcium 
20 phosphate mediated gene delivery, electroporation, microinjection or proteoliposomes. The 
transduced cells can then be infused (e.g., in a pharmaceutically acceptable carrier) or 
homotopically transplanted back into the subject per standard methods for the cell or tissue type. 
Standard methods are known for transplantation or infusion of various cells into a subject 

[124] If in vivo delivery methods are performed the methods can be designed to deliver 
25 the nucleic acid constructs directly to a particular cell type, via any delivery mechanism, such as 
intra-peritoneal injection of a vector construct. In this type of delivery situation, the nucleic acid 
constructs can be delivered to any type of tissue, for example, brain or neural or muscle. The 
nucleic acid constructs can also be delivered such that they generally deliver the nucleic acid 
constructs to more than one type of cell. This type of delivery can be accomplished, by for 
30 example, injecting the constructs intraperitoneally into the flank of the organism. (See Example 
2 and figures 8-10). In certain delivery methods, the timing of the dehvery is monitored. For 
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example, the nucleic acid constructs can be delivered at the perinatal stage of the recipient's life 
or at the adult stage. 

[125] The various vectors delivering the opioid receptors, such as the m-opioid 
receptor can be delivered to differentiated cells. For example, cells are quiescent can be 
targeted with the disclosed vectors in certain embodiments. For example, nerve cells, which are 
no long^ dividing, or are dividing very slowly, can be transfected with the disclosed 
compositions in certain embodiments. The nucleic acids can be delivered peripherally in certain 
embodiments and can be delivered by injection at a site distal to the body of the cell. For 
example, pain may be initiated at a point in the foot of an organism, but the body of the nerve 
transmitting the pain signal will be located at or near the spinal cord. In certain embodiments, 
the compositions can be delivered at the foot, transfectmg the distal part of the nerve, including 
the most distal part of the nerve. Transfection, can take place along the full length of the cell, 
however. In certain embodiments, the vectors are delivered by injection at a site distal to a 
nCTve bocfy, or, for example, at the point of the pain with regard to where the body of the nerve 
is located. 

4. Expression systems 

[126] The nucleic acids that are delivered to cells typically contain expression 
controlling systems. For example, the inserted genes in viral and retroviral systems usually 
contain promoters, and/or enhancers to help control the expression of the desired gene product 
20 A promoter is generally a sequence or sequences of DNA that function when in a relatively 
jBxed location in regard to the transcription start site. A promoter contains core elements 
required for basic interaction of KNA polymerase and transcription fectors, and can contain 
upstream elements and response elements. 

a) Promoters and Enhancers 

25 [127] Preferred promoters controlling transcription from vectors in mammalian host 

cells can be obtained from various sources, for example, the genomes of viruses such as: 
polyoma, Simian Vims 40 (SV40), adenovirus, retroviruses, hepatitis-B virus and most 
preferably cytomegalovirus, or from heterologous mammalian promoters, e.g. beta actin 
promoter. The early and late promoters of the SV40 virus are conveniently obtained as an SV40 

30 restriction fragment which also contains the SV40 viral origin of replication (Fiers et al.. Nature, 
273: 1 13 (1978)). The immediate early promoter of the human cytomegalovirus is conveniently 
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obtained as a HindinE restriction fragment (Greenway, P J. etaL,Genel8: 355-360(1982)). 
Of course, promoters from the host cell or related species also are useful herein. 

[128] Enhancer generally refers to a sequence of DNA that functions at no fixed 
distance from the transcription start site and can be eitiier 5' (Laimins, L. et al., Proc. Natl. 

5 Acad. Sci. 7 8: 993 (1981)) or 3' (Lusky, M.L., et aL, Mol. Cell Bio. 3 : 1 108 (1983)) to the 
transcription unit Furthermore, enhancers can be within an intron^anerji,J.L. et aL, Cell 33: 
729 (1983)) as well as within the coding sequence itself (Osborne, T.F., et al., Mol. Cell Bio. 
4: 1293(1984)). They are usuaUy between 10 and 300 bp in length, and they function in c^^ 
Enhancers function to increase transcription from nearby promoters. Enhancers also often 

10 contain response elements that mediate the regulation of transcription. Promoters can also 
contain response elements that mediate the regulation of transcriptioiL Enhancers often 
determine the regulation of expression of a g^e. While many enhancer sequences are now 
known from mammalian genes (globin, elastase, albumin, -fetoprotein and insulin), typically 
one will use an enhancer from a eukaryotic cell virus for general expression. Preferred 

15 examples are the SV40 enhancer on the late side of Ifae replication origin (bp 1 00-270), the 
cytomegalovirus early promoter enhancer, the polyoma ^ihancer on the late side of the 
replication origin, and adenovirus enhancers. 

[129] The promoter and/or enhancer can be specifically activated either by light or 
specific chemical events which trigg^ tiieir function. Systems can be regulated by reagents 
20 such as tetracycline and dexamethasone. There are also ways to enhance viral vector gene 
expression by exposure to irradiation, such as gamma irradiation, or alkylating chemotherapy 
drugs. 

[130] In certain embodiments the promoter and/or enhancer region can act as a 
constitutive promoter and/or enhancer to maximize expression of the region of the transcription 
25 unit to be transcribed. In certain constmcts the promoter and/or enhancer region be active in all 
eukaryotic cell types, even if it is only expressed in a particular type of cell at a particular time. 
A preferred promoter of this type is the CMV promoter (650 bases). Other preferred promoters 
are S V40 promoters, cytomegalovirus (full length promoter), and retroviral vector LTF. 

[131] It has been shown that all specific regulatory elements can be cloned and used to 
30 construct expression vectors that are selectively expressed in specific cell types such as 
melanoma cells. The glial fibrillaiy acetic protein (GFAP) promoter has been used to 
selectively express genes in cells of glial origin. 
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[ 1 32] Expression vectors used in eiikaryotic host cells (yeast, fungi, insect, plant, 
animal, human or nucleated cells) can also contain sequences necessary for the termination of 
transcription which can affect mRNA expression. These regions are transcribed as 
polyadenylated segments in the imtranslated portion of the mKNA encoding tissue factor 
5 protein. The 3* untranslated regions also include transcription termination sites. It is preferred 
that the transcription unit also contams a polyadenylation region. One benefit of this region is 
that it increases the likelihood that the transcribed unit will be processed and transported like 
mRNA. The identification and use of polyadenylation signals in expression constructs is well 
established. It is preferred that homologous polyadenylation signals be used in the transg^e 
10 constructs. In certain transcription units, the polyadenylation region is derived from the SV40 
early polyadenylation signal and consists of about 400 bases. It is also preferred that the 
transcribed units contain other standard sequences alone or in combination with the above 
sequences improve expression from, or stability of, the construct 

b) Constitative promoters 

15 [133] In certain embodiments the promoters are constitutive promoters. This can be 

any promoter that causes transcription regulation in the absence of the addition of other factors. 
Examples of this type of promoter are the CMV promoter and the beta actin promoter, as well as 
others dicussed herein. In certain embodiments the promoter can consist of fusions of one or 
more different types of promoters. For example, the regulatory regions of the CMV promoter 

20 and the beta actin promoter are well known and understood, examples, of which are disclosed 
herein. Parts of tiiese promoters can be fused together to, for example, produce a CMV-beta 
actin fusion promoter. It is understood that this type of promoter has a CMV component and a 
beta actin component These components can function independently as promoters, and thus, 
are themselves considered beta actin promoters and CMV promoters. A promoter can be any 

25 portion of a known promoter that causes promoter activity. It is well understood that many 
promoters, including the CMV and Beta Actin promoters have functional domains which are 
understood and that these can be used as a beta actin promoter or CMV promoter. Furthermore, 
these domains can be determined. There are many CMV promoter variations that exist, as well 
as beta actin promoters, and fijsion promoters. These promoters can be compared, and for 

30 example, fimctional regions delineated, as described herein. Furthermore, each of these 
sequences can function independently or together in any combination to provide a promoter 
region for the disclosed nucleic acids. 
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c) NoB'-constitutive promoters 

[134] The promoters can also be non-constitutive promoters, such as cell specific 
promoters. These are promote that are turned on at specific time in development or st^ or a 
particular type of cell, such as a cardiac cell, or neural cell, or a bone cell. Some examples of 
5 cell specific promoters are, the neural enolase specifc promoter, (NSE) the COLLI Al 
procollagen promoter, and the CDl lb promoter (PBMC-microglia/macrophage/monocyte 
specific promoter. 

[02] It is understood that the recombinant systems can be expressed in a tissue- 
specific manner. It is understood that tissue specific expression can occur due to the presence of 

10 a tissue-specific promoter. Typically, proteins under control of a tissue-specific promoter are 
transcribed when the promoter becomes active by virtue of being present in the tissue for which 
it is specific. Therefore, all cells can encode for a particular gene wi&out global expression. As 
such, labeled proteins can be shown to be present in certain tissues without expression in other 
nearby tissues that may conq^Ucate results or expression of proteins in tissues where expression 

15 may be detrimental to the host Disclosed are methods wherein the ere recombinase is under the 
control of the EIIA promoter, a promoter specific for breast tissue, such as the WAP promoter, a 
promoter specific for ovarian tissue, such as die ACTB promoter, or a promoter specific for 
bone tissue, such as osteocalcin. Any tissues specific promoter can be used. Promoters specific 
for prostate, testis, and neural are also disclosed Examples of some tissue-specific promoters 

20 include but are not limited to MUCl, EUA, ACTB, WAP, bHlJH-EC2, HOXA-1, Alpha- 

fetoprotein (AFP), opsin, CRl/2, Fc-y-Receptor 1 (Fc-y-Rl), MMTVD-LTR, the human insulin 
promoter, Pdha-2, rat neuron-specific enolase. For exaionqple, use of the AFP promoter creates 
specificity for the liver. Another example, HOXA-1 is a neuronal tissue specific promoter, and 
as such, proteins expressed under the control of HOXA-1 are only expressed in neuronal tissue. 

25 (All of which are herein incorporated by reference at least for the sequence of the promoters 
and related sequences.) 

[135] Other cell specific promoters can be found in (SutclifiFe,J.G. (1988), Rev. 
Neuroscience 1 1, 157-198). For example, when transfecting nerve cells, there are a variety of 
nerve specific promoters, such as the neuron specific enolase promoter. Other examples of 
30 neuron specific promoters would be the Tau promoter, Synapsin I (Hoesche, C, Sauerwald, A., 
et al., (1993) J. Biol Chein, 268, 26494-26502.and H (Chin, L.-S et al., (1994), J. Biol. Chem. 
269, 18507-18513) promoters, the amino acid decarboxylase (AADC) (Albert, V., et al., (1992), 
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Proc. Natl Acad, ScL 89, 12053-12057) and FE65 (Faraonio, R., et ah, (1994), Nticl Acids Res, 
11, 4876-4883) promoters. Other nerve speciiSc promoters include, the promoter for the WTl 
gene (Fraizer, G, et al., (1994), /. Biol Chem, 269, 8892-8900), nuerofflament light chain 
promoter (Yazdanbakhsh, K., et al., (1993) Nucl Acids Res, 21, 455-461), and the glial 
5 fibrillary acidic protein, (Kaneko, R. & Sueoka, N. (1993) Proc. Natl Acad, Sci. 90, 4698- 
4702), (All of which are herein incorporated by reference at least for the sequence of the 
promoters and related sequences.) 

[136] Expression of fhe transgene can be targeted selectively to neurons by cloning a 
neuron specific promoter^ such as flie NSE promoter as disclosed herein (Liu H. et al.. Journal 

10 ofNeuroscience. 23(1 8):71 43-54, 2003); tyrosine hydroxylase promoter (Kessler MA. et al.. 
Brain Research. Molecular Brain Research. 112(l-2):8-23, 2003); myelin basic protein 
promoter (Kessler MA. et al Biochemical & Biophysical Research Communications. 
288(4):809'18, 2001); glial fibrillaiy acidic protein promoter (Nolte C. et a!., GLIA. 33(1):?2- 
86, 2001); neurofilaments gene (heavy, medium, light) promoters (Yaworsky PJ.et al.. Journal 

15 of Biological Chemistry. 272(40):251 12-20, 1997 ) (All of which are herein incorporated by 
reference at least for the sequence of the promoters and related sequences.) The NSE promoter 
is disclosed in Peel AL. et al., Gene Therapy. 4(1):1&24, 1997 ) (SEQ ID NO:69) (plR- 
NT3myc; Powell Gene Therapy Center, University of Florida, Gainesville FL). 

d) Markers 

20 [1 37] The viral vectors can include nucleic acid sequence encoding a marker product 

This marker product is used to determine if the gene has been delivered to the cell and once 
delivered is being expressed. Preferred marker genes are the £*. Co/ilacZ gene, which encodes 
B-galactosidase, and green fluorescent protein. 

[138] some embodiments the marker can be a selectable marker. Examples of 
25 suitable selectable markers for matmnalian cells are dihydrofolate reductase (DHFR), thymidine 
kinase, neomycin, neomycin analog G418, hydromycin, and puromycin. When such selectable 
markers are successfully transferred into a mammalian host cell, the transformed mammalian 
host cell can survive if placed under selective pressure. There are two widely used distmct 
categories of selective regimes. The first category is based on a cell's metabolism and the use of 
30 a mutant cell line which lacks fhe abflity to grow independent of a supplemented media. Two 
examples are: CHODHFR- cells and mouse LTK- cells. These cells lack fhe ability to grow 
without the addition of such nutrients as thymidine or hypoxanfhine. Because &ese cells lack 
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certain genes necessary for a complete nucleotide synthesis pathway, they cannot survive unless 
the missing nucleotides are provided in a supplemented media. An alternative to supplementing 
the media is to introduce an intact DHFR or TK gene into cells lacking the respective genes, 
thus altering their growth requirements. Individual cells which were not transformed with the 
• 5 DHFR or TK gene will not be capable of survival in non-supplemented media. 

[139] The second category is dominant selection which refers to a selection scheme 
used in any cell type and does not require the use of a mutant cell line. These schemes typically 
use a drug to arrest growth of a host cell. Those cells which have a novel gene would express a 
protein conveying drug resistance and would survive the selection. £xanq)les of such dominant 

10 selection use the drugs neomycin, (Southern P. and Berg, P., J. Molec. Appl Genet. 1 : 327 
(1982)), mycophenolic acid. (Mulligan, ILC. and Berg, P. Science 209: 1422 (1980)) or 
hygromycin, (Sugden, B. etal.,MoL Cell Biol 5: 410-413(1985)). The three examples 
employ bacterial genes under eukaiyotic control to convey resistance to the appropriate drug 
G418 or neomycin (geneticin), xgpt (mycophenolic acid) or hygromycin, respectively. Others 

15 include theneomycin analog G41 8 and puiamycin. 

e) Post transcriptional regulatory elements 

[140] The disclosed vectors can also contain post-transcriptional regulatory elements. 
Post-transcriptional regulatory elements can enhance mRNA stability or enhance translation of 
the transcribed mRNA. An exemplary post-transcriptional regulatory sequence is tiie WPRE 
20 sequence isolated from the woodchuck hepatitis virus. (Zufferey R, et al., ' Woodchuck 
hepatitis virus post-transcriptional regulatory element enhances expression of transgenes 
delivered by retroviral vectors," J Virol; 73:2886-92 (1999)). Post-transcriptional regulatory 
elements can be positioned both 3' and 5* to the exogenous gene, but it is preferred that they are 
positioned 3' to the exogenous gene. 

25 f) Transduction efficiency elements 

[141] Transduction efficiency elements are sequences that enhance the packaging and 
transduction of the vector. These elements typically contain polypurine sequences. An example 
of a transduction eiBficiency element is the ppt-cts sequence that contains the central polypurine 
tract (ppt) and central terminal site (cts) from the FIV. These sequences are in the disclosed FIV 
30 sequences herein. Each retrovirus and lentivirus can have there own ppt-cts. 

g) 3* mstranslated regions 
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[142] Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, 
animal, human or nucleated cells) can also contain sequences necessary for the termination of 
transcription which can affect mRNA expression. These 3' untranslated regions are transcribed 
as polyadenylated segments in the untranslated portion of the mRNA encoding the exogenous 
5 gene. The 3' untrarislated regions also include traiiscription termination sites. The transcription 
imit also can contain a polyadenylation region. One benefit of this region is that it increases the 
likelihood that the transcribed unit will be processed and transported like mKNA. The 
identification and use of polyadenylation signals in expression constmcts is well established. 
Homologous polyadenylation signals can be used in the transgene constructs. In an embodiment 
10 of the transcription unit, the polyadenylation region is derived &om the SV40 early 

polyadenylation signal and consists of about 400 bases. Transcribed units can contain other 
standard sequences alone or in combination with the above sequences improve expression firom, 
or stability of, the construct. 

5. Sequence similarities 

15 [143] It is understood that as discussed herein the use of the terms homology and 

identity mean fee same thing as similarity. Thus, for example, if the use of the word homology 
is used between two non-natural sequences it is understood that this is not necessarily indicating 
an evolutionary relationship between these two sequences, but rather is looking at the similarity 
or relatedness between their nucleic acid sequences. Many of the methods for determining 

20 homology between two evolutionarily related molecules are routinely applied to any two or 
more nucleic acids or proteins for the purpose of measuring sequence similarity regardless of 
whether they are evolutionarily related or not. 

[144] In general, it is understood that one way to define any known variants and 
derivatives or feose that might arise, of the disclosed ^nes and proteins herein, is through 

25 defining the variants and derivatives in terms of homology to specific known sequences. This 
identity of particular sequences disclosed herein is also discussed elsewhere herein. In general, 
variants of genes and proteins herein disclosed typically have at least, about 70, 71 , 72, 73, 74, 
75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 
percent homology to fee stated sequence or fee native sequence. Those of skill in fee art readily 

30 understand how to determine fee homology of two proteins or nucleic acids, such as genes. For 
example, fee homology can be calculated after aligning fee two sequences so feat fee homology 
is at its higlhest level. 
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[1 45] Anoflier way of calculating homology can be performed by published algorithms. 
Optimal alignment of sequences for comparison can be conducted by the local homology 
algorithm of Smith and Waternian^^rfv. Appl Math, 2: 482 (1981), by the homology alignment 
algorithm of Needleman and Wunsch, J, MoL Biol 48: 443 (1970), by the search for similarity 
5 method of Pearson and Lipman, Proc. Natl Acad. Scl U,SA. 85: 2444 (1988), by 

computerized miplementations of these algori&ms (GAP, BESTFTT, FASTA, and TFASTA in 
the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., 
Madison, W]^, or by inspection. 

[146] The same types of homology can be obtained for nucleic acids by for example 
10 tiiealgorittmis disclosed in Zuker,M. iSc/e^tc^ 244:48-52, 1989, Jaeger etal. Proc. NatL Acad. 
Sci. USA 96:7706-7710, 19i9jaegex etsi. Methods Enzym 183:281-306, 1989 which are 
herein incorporated by reference for at least material related to nucleic acid alignment. It is 
understood that any of the meHiods typically can be used and that in certaui instances the results 
of these various methods can differ, but the skilled artisan understands if idratlty is found with 
1 5 at least one of these methods, the sequences would be said to have the stated identity, and be 
disclosed herein. 

[147] For example, as \ised herein, a sequence recited as having a particular percent 
homology to another sequence refers to sequences that have the recited homology as calculated 
by any one or more oftiie calculation methods described above. For example, a first sequence 

20 has 80 percent homology, as defined herein, to a second sequence if the first sequence is 
calculated to have 80 percent homology to the second sequence using the Zuker calculation 
method even if the first sequence does not have 80 percent homology to the second sequence as 
calculated by any of the other calculation methods. As another example, a first sequence has 80 
percent homology, as defined herein, to a second sequence if the first sequence is calculated to 

25 have 80 percent homology to the second sequence using both the Zuker calculation method and 
the Pearson and Lipman calculation method even if the first sequence does not have 80 percent 
homology to the second sequence as calculated by the Smith and Waterman calculation method, 
fee Needleman and Wunsch calculation mefeod, the Jaeger calculation methods, or any of the 
ofeer calculation mefeods. As yet another example, a first sequence has 80 percent homology, 

30 as defined herein, to a second sequence if fee first sequence is calculated to have 80 percent 
homology to fee second sequence using each of calculation mefeods (alfeough, in practice, fee 
different calculation mefeods will often result in different calculated homology percentages). 
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6. Hybridization/selective hybridization 

[148] The term hybridization typically means a sequence driven interaction between at 
least two nucleic acid molecules, such as a primer or a probe and a gene. Sequence driven 
interaction means an interaction that occurs between two nucleotides or nucleotide analogs or 
5 nucleotide derivatives in a nucleotide specific manner. For example, G interacting with C or A 
interacting with T are sequence driven interactions. Typically sequence driven interactions 
occur on the Watson-Crick face or Hoogsteen fece of the nucleotide. The hybridization of two 
nucleic acids is affected by a number of conditions and parameters known to those of skill in the 
art. For example, the salt concentrations, pH, and temperature of the reaction all affect whether 
1 0 two nucleic acid molecules will hybridize. 

[149] Parameters for selective hybridization between two nucleic acid molecules are 
well known to those of skill in the art. For example, in some embodiments selective 
hybridization conditions can be defined as stringent hybridization conditions. For example, 
stringency of hybridization is controlled by both temperature and salt concentration of either or 

1 5 both of the hybridization and washing steps. For example, the conditions of hybridization to 
achieve selective hybridization can involve hybridization in high ionic strragth solution (6X 
SSC or 6X SSPE) at a temperature that is about ll-lS^'C below the Tm (the melting temperature 
at which half of the molecules dissociate from their hybridization partners) followed by washing 
at a combination of temperature and salt concentration chosen so that the washing temperature 

20 is about 5°C to 20^C below the Tm. The temperature and salt conditions are readily 
determiued empirically in prelinunaiy experiments in which samples of reference DNA 
immobiUzed on filters are hybridized to a labeled nucleic acid of interest and then washed under 
conditions of different stringencies. Hybridization temperatures are typically higher for DNA- 
RNA and ENA-RNA hybridizations. The conditions can be used as described above to achieve 

25 stringency, or as is known in the art. (Sambrook et al., Molecular Cloning: A Laboratory 
Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 1989; 
Kunkeletal. Methods Enzymol. 1987:154:367, 1987 which is herein incorporated by reference 
for material at least related to hybridization of nucleic acids). A preferable stringent 
hybridization condition for a DNA:DNA hybridization can be at about 68X (in aqueous 

30 solution) in 6X SSC or 6X SSPE followed by washing at SS^'C. Stringency of hybridization and 
washing, if desired, can be reduced accordingly as the degree of complementarity desired is 
decreased, and further, depending upon the G-C or A-T richness of any area wherein variabOity 
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is searched for. Likewise, stringency of hybridization and washing, if desired, can be increased 
accordingly as homology desired is increased, and further, depending upon the G-C or A-T 
richness of any area wherein high homology is desired, all as known in the art 

[ISO] Another way to define selective hybridization is by looking at the amount 
5 (percentage) of one of the nucleic acids bound to the other nucleic acid. For example, in some 
embodiments selective hybridization conditions would be when at least about, 60, 65, 70, 71, 
72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86. 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 
97, 98, 99, 100 percent of the limiting nucleic acid is bound to the non-limiting nucleic acid. 
IVpically, the non-limiting primer is in for example, 10 or 100 or 1000 fold excess. This type of 
10 ass^ can be performed at under conditions where both the limiting and non-limiting prim^ are 
for sample, 10 fold or 100 fold or 1000 fold below their kd» or whexe only one of the nucleic 
acid molecules is 10 fold or 100 fold or 1000 fold or where one or both nucleic acid molecules 
are above their kd. 

[151] Another way to define selective hybridization is by looking at the percentage of 
15 primer that gets enzymatically manipulated under conditions where hybridization is required to 
promote the desired enzymatic manipulation. For example, in some embodiments selective 
hybridization conditions would be when at least about, 60, 65, 70, 71, 72, 73, 74, 75, 76, 77, 78, 
79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of 
the primer is enzymatically manipulated under conditions which promote the enzymatic 
20 manipulation, for example if the enzymatic manipulation is DNA extension, then selective 

hybridization conditions would be when at least about 60, 65, 70, 7 1 , 72, 73, 74, 75, 76, 77, 78, 
79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100 percent of 
the primer molecules are extended. Preferred conditions also include those suggested by the 
manufacturer or indicated in the art as being appropriate for the enzyme performing the 
25 manipulation. 

[1 52] Just as with homology, it is understood that there are a variety of methods herein 
disclosed for determining the level of hybridization between two nucleic acid molecules. It is 
imderstood that these methods and conditions can provide different percentages of hybridization 
between two nucleic acid molecules, but unless otherwise indicated meeting the parameters of 
30 any of the methods would be suflBcient For example if 80% hybridization was required and as 
long as hybridization occurs within the required parameters in any one of these methods it is 
considered disclosed herein. 
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[153] It is understood that those of skill in the art understand that if a composition or 
method meets any one of these criteria for determining hybridization either collectively or singly 
it is a composition or method that is disclosed herein. 

7. Nucleic acids 

5 [154] There are a variety ofmolecules disclosed herein that are nucleic acid based, 

including for example the nucleic acids that encode, for example ^-opioid receptor, or 
functional nucleic acids. The disclosed nucleic adds can be made up of for example, 
nucleotides, nucleotide analogs, or nucleotide substitutes. Non-limiting examples of these and 
ofh^ molecules are discussed herein. It is understood that for example, when a vector is 

10 expressed in a cell, that the expressed mRNA will typically be made up of A, C, G, and U. 

likewise, it is understood that if, for example, an antisense molecule is introduced into a cell or 
cell enviroimxent through for example exogenous delivery, it is advantagous that the antisense 
molecule be made up of nucleotide analogs that reduce the degradation of the antisense 
molecule in the cellular environment 

15 [1 55] A nucleotide is a molecule that contains a base moiety, a sugar moiety and a 

phosphate moiety. Nucleotides can be linked together through their phosphate moieties and 
sugar moieties creating an intemucleoside linkage. The base moiety of a nucleotide can be 
adenin-9-yl (A), cytosin-i-yl (C), guanin-9-yl (G), uraciH-yl (U), and thymin-l-yl (T). Hie 
sugar moiety of a nucleotide is a ribose or a deoxyribose. The phosphate moiety of a nucleotide 

20 is pentavalent phosphate. A non-limiting example of a nucleotide would be 3*-AMP (3 - 
adenosine monophosphate) or 5*-GMP (5'-guanosme monophosphate). 

[156] A nucleotide analog is a nucleotide which contains some type of modification to 
either the base, sugar, or phosphate moieties; Modifications to nucleotides are well known in 
the art and would include for example, 5-methylcytosine (5-me-C), 5-hydroxymethyl cytosine, 
25 xanthine, hypoxanthine, and 2-aminoadenine as well as modifications at the sugar or phosphate 
moieties. 

[157] Nucleotide substitutes are molecules having similar functional properties to 
nucleotides, but which do not contain a phosphate moiety, such as peptide nucleic acid (PNA). 
Nucleotide substitutes are molecules that will recognize nucleic acids in a Watson-Crick or 
30 Hoogsteen manner, but which are linked together through a moiety other than a phosphate 
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moiety. Nucleotide substitutes are able to confoim to a double helix type structure when 
interacting with the appiropriate target nucleic acid. 

[1 S8] It is also possible to link other types of molecules (conjugates) to nucleotides or 
nucleotide analogs to enhance for example, cellular uptake. Conjugates can be chemically 
5 linked to the nucleotide pr nucleotide analogs. Such conjugates include but are not limited to 
lipid moieties such as a cholesterol moiety. (Letsinger et al., Proc, Natl Acad, Set USA, 
1989,86. 6553-6556), 

[1 59] A Watson-Crick interaction is at least one interaction with the Watson-Crick face 
of a nucleotide, nucleotide analog, or nucleotide substitute. The Watson-Crick fece of a 
1 0 nucleotide, nucleotide analog, or nucleotide substitute includes the C2, N 1 , and C6 positions of 
a purine based nucleotide, nucleotide analog, or nucleotide substitute and the C2, N3, C4 
positions of a pyrimidine based nucleotide, nucleotide analog, or nucleotide substitute. 

[160] A Hoogsteen interaction is the interaction that takes place on the Hoogsteen face 
of a nucleotide or nucleotide analog, which is exposed in the major groove of duplex DNA. The 
1 5 Hoogsteen face includes flie N7 position and reactive groups CNH2 or O) at the C6 position of 
purine nucleotides. 

a) Sequences 

[161] There are a variety of sequences related to ^-opioid receptor and promoter 
sequences. These sequences and others are herein incorporated by reference in their entireties as 
20 well as for individual subsequences contained therein. It is understood that there are numerous 
Genbank accession sequences related to |x-opioid receptor, all of which are incorporated by 
reference herein. 

[162] One particular sequence set forth in SEQ E) NO:2 , which is a sequence for 
human ji-opioid receptor cDNA, is used herein, as an example, to exemplify flie disclosed 

25 compositions and methods. It is understood that the description related to this sequence is 
applicable to any sequence related to ^-opioid receptor unless specifically indicated otherwise. 
Those of skill in fhe art understand how to resolve sequence discrepancies and differences and 
to adjust the compositions and methods relating to a particular sequence to other related 
sequences. Prim^ and/or probes can be designed for any of the sequences disclosed herein 

30 given the information disclosed herein and that known in the art. 



— 47 — 



wo 2004/073646 



PCT/US2004/004914 



[163] It is also understood for example that there are numerous vectors that can be used 
to create the p-opioid receptor construct nucleic acids. 

b) Primers and probes 

[1 64] Disclosed are compositions including primers and probes^ which are capable of 

5 interacting widi, for example, the )x-opioid receptor construct nucleic acids, as disclosed herein. 
In certain embodiments the primers are used to support DNA amplilBcation reactions. 
Typically the primers will be capable of being extended in a sequence specific manner. 
Extension of a primer in a sequence specific manner includes any methods wherein the 
sequence and/or composition of the nucleic acid molecule to which the primer is hybridized or 

10 otherwise associated directs or influences the composition or sequence of tiie product produced 
by the extension of the primer. Extension of the primer in a sequence specific manner therefore 
includes, but is not limited to, PGR, DNA sequencing, DNA extension, DNA polymerization, 
KNA transcription, or reverse transcription. Techniques and conditions that amplify the primer 
in a sequence specific manner are preferred. In certain embodiments the primers are used for 

15 the DNA amplification reactions, such as PGR or direct sequencing. It is understood that in 
certain embodiments the primers can also be extended using non-enzymatic techniques, where 
for example, the nucleotides or oligonucleotides used to extend the primer are modified such 
that they will chemically react to extend the primer in a sequence specific maimer. Typically the 
disclosed primers hybridize with, for example, the |j.-opioid receptor construct nucleic acid, or 

20 region of the ^.-opioid receptor construct nucleic acids or they hybridize with the complement of 
the fx-opioid receptor construct nucleic acids or complement of a region of the propioid receptor 
construct nucleic acids. 

8. Peptides 

a) Protein variants 

25 [165] As discussed herein there are numerous variants of the |n-opioid receptor protein 

that are known and herein contemplated. In addition, to the known fimctional species and 
allelic variants of |i-opioid receptor there are derivatives of the |i-opioid receptor protein which 
also function in flie disclosed methods and compositions. Protein variants and derivatives are 
well understood to those of skill in the art and in can involve amino acid sequence 

30 modifications. For example, amino acid sequence modifications ^ically fall into one or more 
of three classes: substitutional, insertional or deletional variants, hisertions include amino 
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and/or carboxyl teraiinal fusions as well as intrasequence insertions of single or multiple amino 
acid residues. Insertions ordinarily will be smaller insertions than those of amino or carboxyl 
terminal fusions, for example, on the order of one to four residues. Immunogenic fusion protein 
derivatives, such as those described in the examples, are made by fusing a polypeptide 
5 sufficiently large to confer immunogenicity to the target sequence by cross-linking in vitro or by 
recombinant cell culture transformed with DNA encoding the fusion. Deletions are 
characterized by the removal of one or more amino acid residues fix)m the protein sequence. 
Typically, no more than about from 2 to 6 residues are deleted at any one site within the protein 
molecule. These variants ordinarily are prepared by site specific mutagenesis of nucleotides in 

10 flie DNA encoding the protein, thereby pmducing DNA encoding the variant, and thereafter 
expressing the DNA in recombinant cell culture. Techniques for making substitution mutations 
at predetermined sites in DNA having a known sequence are well known, for example Ml 3 
primer mutagenesis and PGR mutagenesis. Amino acid substitutions are typically of single 
residues, but can occur at a number of different locations at once; insertions usually will be on 

15 the order of about from 1 to 10 amino acid residues; and deletions will range about fi:om 1 to.30 
residues. Deletiom or insertions preferably are made in adjacent pairs, i.e. a deletion of 2 
residues or insertion of 2 residues. Substitutions, deletions, insertions or any combination 
thereof can be combined to arrive at a final construct The mutations must not place the 
sequence out of reading frame and preferably will not create complementary regions that could 

20 produce secondary mRNA structure. Substitutional variants are those in which at least one 
residue has been removed and a different residue niserted in its place. Such substitutions 
generally are made in accordance with the following Tables 4 and S and are referred to as 
conservative substitutions. 

[166] TABLE 4:Amino Acid Abbreviations 



Amino Acid 


Abbreviations 


alanine 


AlaA 


allosoleucine 


Aile 


arginine 


ArgR 


aspara^ne 


AsnN 


aspartic acid 


AspD 


cysteine 


CysC 


glutamic acid 


GluE 


glutamine 


GhiK 


glycine 


GlyG 


histidine 


HisH 


isolelucine 


Hel 


leucine 


LeuL 
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Amino Acid 


Aobreviations 


lysine 


LysK 


phenylalanine 


rneJr 


proline 


rroF 


pyrogiuiainic 
acidp 


VJlU 


serine 


SerS 


threonine 


ThrT 


tyrosine 


TyiY 


tryptophan 


TrpW 


valine 


ValV 



TABLE S:Anuno Acid Substitations 


Original Residue Exemplary Conservative Substitutions, o&eis are known 
in&eait. 


Ala 


ser 


Arg 


lys, gin 


Asn 


g1«; his 


Asp 


glu 


Cys 


ser 


Gin 


asn, lys 


Glu 


asp 


Gly 


pro 


His 


asn;gln 


ne 


leu; val 


Leu 


ile; val 


Lys 


arg; gin; 


Met 


Leu; ile 


Phe 


met; leu; tyr 


Ser 


thr 


Thr 


ser 


Trp 


tyr 


Tyr 


trp; phe 


Val 


ile; leu 



[167] Substantial changes in function or immunological identity are made by selecting 
substitutions that are less conservative than fliose in Table 2, i.e., selecting residues that differ 
more significantly in their effect on maintaining (a) the structure of the polypeptide backbone in 

5 the area of the substitution, for example as a sheet or helical conformation, (b) the charge or 
hydrophobicity of the molecule at the target site or (c) the bulk of the side chain. The 
substitutions which in general are expected to [nroduce the greatest chants in the protein 
properties will be those in which (a) a hydrophilic residue, e.g. seryl or ttiieonyl, is substituted 
for (or by) a hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, valy] or alanyl; (b) a 

10 cysteine or proline is substituted for (or by) any other residue; (c) a residue having an 
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electropositive side chain, e.g., lysyl, arginyl, or histidyl, is substituted for (or by) an 
electronegative residue, e.g., glutamyl or aspartyl; or (d) a residue having a bulky side chain, 
e.g., phenylalanine, is substituted for (or by) one not having a side chain, e.g., glycine, in this 
case, (e) by increasing the number of sites for sulfation and/or glycosylation. 

5 [168] For example, the replacement of one amino acid residue with another that is 

biologically and/or chemically similar is known to those skilled in the art as a conservative 
substitution. For example, a conservative substitution would be replacing one hydrophobic 
residue for another, or one polar residue for another. The substitutions include combinations 
such as, for example, Gly, Ala; Val, lie. Leu; Asp, Glu; Asn, Gin; Ser, Thr, Lys, Arg; and Phe, 
10 Tyi. Such conservatively substituted variations of each explicitly disclosed sequence are 
included withm the mosaic polypeptides provided herein. 

[1 69] Substitutional or deletional mutagenesis can be employed to insert sites for N- 
glycosylation (Asn-X-Thi/Ser) or 0-glycosylation (Ser or Thr). Deletions of cysteine or other 
labile residues also maybe desirable. Deletions or substitutions of potential proteolysis sites, 
15 e.g. Axg, is accomplished for example by deleting one of the basic residues or substituting one 
by glutaminyl or histidyl residues. 

[170] Certain post-translational derivatizations are the result of the action of 
recombinant host cells on the expressed polypeptide. Glutaminyl and asparaginyl residues are 
frequentiy post-translationally deamidated to the corresponding glutamyl and asparyl residues. 

20 Alternatively, these residues are deamidated under mildly acidic conditions. Other post- 
translational modifications include hydroxylation of proline and lysine, phosphorylation of 
hydroxyl groups of seryl or tiireonyl residues, mefhylation of the o-amino groups of lysine, 
arginine, and histidine side chains (T.E. Crei^ton, Proteins: Structure and Molecular 
Properties, W. H. Freeman & Co., San Francisco pp 79-86 (1983)), acetylation of the N- 

25 terminal amine and, in some instances, amidation of the C-terminal carboxyl. 

[171] It is understood that one way to define the variants and derivatiives of the 
disclosed proteins herein is through defining the variants and derivatives in terms of 
homology/identity to specific known sequences. For example, SEQ ID NO:l sets forth a 
particular sequence of fi-opioid receptor. Specifically disclosed are variants of these and other 
30 proteins herein disclosed which have at least, 70% or 75% or 80% or 85% or 90% or 95% 

homology to the stated sequence. Those of skill in the art readily understand how to determine 
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the homology of two proteins. For example, the homology can be calculated after aligning ihe 
two sequences so that the homology is at its highest level. 

[172] Anofhet way of calculating homology can be performed by published algorithms. 
Optimal alignment of sequences for comparison can be conducted by &e local homology 
algorithm of Smith and Waterman Adv. AppL Math. 2: 482 (1981), by the homology 
alignment algorithm of Needleman and Wunsch, J. MoL Biol. 48: 443 (1970), by iiie search 
for similarity mefliod of Pearson and Lipman,Proc. Natl. Acad. Sci. U.S.A. 85:2444(1988), 
by computerized implementations of these algorithms (GAP, BESTFTT, FASTA, and TFASTA 
in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., 
Madison, WI), or by infection. 

[1 73] The same lypes of homology can be obtained for nucleic acids by for example 
the algorithms disclosed in Zuker,M. iScfe«ce 244:48-52, 1989, Jaeger etal. Proc, Natl Acad, 
Set UM &6:7706'7710,l9S%JeLegeiotai, Methods En^ 183:281-306, 1989 which are 
herein incorporated by reference for at least material related to nucleic acid alignment. 

[1 74] It is understood that the description of conservative mutations and homology can 
be combined together in any combination, such as embodiments that have at least 70% 
homology to a particular sequence wherein the variants are conservative mutations. 

[1 75] As this specification discusses various proteins and protein sequences it is 
understood that the nucleic acids that can encode those protein sequences are also disclosed. 
This would include all degenerate sequences related to a specific protein sequence, i.e. all 
nucleic acids having a sequence that encodes one particular protein sequence as well as all 
nucleic acids, including degenerate nucleic acids, encoding the disclosed variants and 
derivatives of the protein sequences. Thus, while each particular nucleic acid sequence may not 
be written out herein, it is understood that each and every sequence is in fact disclosed and 
described herein flirough Ihe disclosed protein sequence. For example, one of the many nucleic 
acid sequences that can encode tiie protein sequence set forth in SEQ ID NO:3 is set forth in 
SEQ ID NO:4 Anottier nucleic add sequence that encodes the same protein sequence set forth 
in SEQ ID N0:3 is set forth in SEQ ID N0:8. In addition, for example, a disclosed 
conservative derivative of SEQ ID N0:3 is shown in SEQ ID NO: 9, where tiie vahne (V) at 
position 21 is changed to an isoleucine (I). It is understood that for tiiis mutation, all of the 
nucleic acid sequences that encode this particular derivative of the SEQ ID NO:3 polypeptide 
are also disclosed. It is also understood that while no amino acid sequence indicates what 
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particular DNA sequence encodes that piotein within an organism, where particular variants of a 
disclosed protein are disclosed herein, the known nucleic acid sequence that encodes that 
protein in the particular organism from which that protein arises is also known and herein 
disclosed and described. 

5 [176] It is understood that there are numerous amino acid and peptide analogs which 

can be incorporated into the disclosed compositions. For example, there are numerous D amino 
acids or amino acids which have a different functional substituent then tiie amino acids shown 
in Table 1 and Table 2. The opposite stereo isomers of naturally occurring peptides are 
disclosed, as well as the stereo isomers of peptide analogs. These amino acids can readily be 

10 incorporated into polypeptide chains by charging tRNA molecules with the amino acid of choice 
and engineering genetic constructs that utilize, for example, amber codons, to insert the analog 
amino acid into a peptide chain in a site specific way (Thorson et al.. Methods in Molec. Biol. 
77:43-73 (1991), ZoUer, Current Opinion in Biotechnology, 3:348-354 (1992); Ibba, 
Biotechnology & Genetic Enginerring Reviews 13:197-216 (1995), Cahill et al., TIBS, 

15 14(10):400-403 (1989); Benner, TIB Tech, 12:158-163 (1994); Ibba and Hennecke, 

Bio/technology, 12:678-682 (1 994) all of which are herein incorporated by reference at least for 
material related to amino acid analogs). 

[177] Molecules can be produced that resemble peptides, but which are not connected 
via a natural peptide linkage. For example, linkages for amino acids or amino acid analogs can 

20 mclude CHzNH-, -CH2S-, -CH2-CH2 -CH=CH- (cis and trans), -COCH2 - 

CH(0H)CH2-, and -CHH2SO — (These and others can be found in Spatola, A. F. in Chemistry 
and Biochemistry of Amino Acids, Peptides, and Proteins, B. Weinstein, eds., Marcel Dekker, 
New York, p. 267 (1983); Spatola, A. F., Vega Data (March 1983), VoL 1, Issue 3, Peptide 
Backbone Modifications (general review); Morley, Trends Phaim Sci (1980) pp. 463-468; 

25 Hudson, D. et al., Iht J Pept Prot Res 14:177-1 85 (1979) (-CH2NH-, CH2CH2~); Spatola et al. 
life Sci 38:1243-1249 (1986) (-CH H2-S); Hann J. Chem. Soc Perkin Trans. 1 307-314 (1982) 
(-CH-CH-, cis and trans); Almquist et al. J. Med. Chem. 23:1392-1398 (1980) (-COCH2-); 
Jennings-White et al. Tetrahedron Lett 23:2533 (1982) (-COCH2-); Szelke et al. European 
Apphi, EP 45665 CA (1982): 97:39405 (1982) (-CH(0H)CH2-); Holladay et al. Tetrahedron. 

30 Lett 24:4401-4404 (1983) (-C(0H)CH2-); and Hruby Life Sci 31 :189-199 (1982) (^CHa-S-); 
each of which is incorporated herein by reference. A particularly preferred non-peptide linkage 
is -CH2NH-. It is understood that peptide analogs can have more than one atom between the 
bond atoms, such as b-alanine, g-aminobutyric acid, and the like. 
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[178] Amino acid analogs and analogs and pq)tide analogs often have enhanced or 
desirable properties, such as, more economical production, greater chemical stability, enhanced 
pharmacological properties (half-life, absorption, potency, efficacy, etc.), altered specificity 
(e.g., a broad-spectrum of biological activities), reduced antigenicity, and otiiers. 
5 [1 79] D-amino acids can be used to generate more stable peptides, because D amino 

acids are not recognized by peptidases and such. Systematic substitution of one or more amino 
acids of a consensus sequence with a D-amino acid of the same type (e.g., D-lysine in place of 
L-lysine) can be used to generate more stable peptides. Cysteine residues can be used to cyclize 
or attach two or more peptides together. This can be beneficial to constrain peptides into 
10 particular conformations. (Rizo and Gierasch Ann. Rev. Biochem. 61:387 (1992), incorporated 
herein by reference). 

[180] 

9. Pharmacentical carriers/Delivery of pharmaceutical products 

[181] As described above, the compositions can also be administered in vivo in a 
15 phannaceutically acceptable carrier. By "pharmaceutically acceptable" is meant a material that 
is not biologically or otherwise undesirable, i.e., the material can be administered to a subject, 
along with the nucleic acid or vector, without causing any undesirable biological effects or 
interacting in a deleterious manner with any of the other components of the pharmaceutical 
composition in which it is contained. The carrier would naturally be selected to minimize any 
20 degradation of the active ingredient and to niinimize any adverse side efifects in the subject, as 
would be well known to one of skill in the art. 

[1 82] The compositions can be administered orally, parenterally (e.g., intravenously), 
by intramuscular injection, by intraperitoneal injection, transdermaUy, extracorporeally, 
topically or the like, including topical intranasal administration or administration by mhsiUnt 

25 As used herein, "topical intranasal administration" means delivery of the compositions into the 
nose and nasal passages through one or both of the nares and can comprise delivery by a 
spraying mechanism or droplet mechanism, or through aerosolization of the nucleic acid or 
vector. Administration of the compositions by inhalant can be through the nose or moutii via 
delivery by a spraying or droplet mechanism. Delivery can also be directly to any area of the 

30 respiratory system (e.g., lungs) via intubation. The exact amount of the compositions required 
will vary bom subject to subject, depending on the species, age, weight and general condition of 
the subject, the severity of the allergic disorder being treated, the particular nucleic acid or 
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vector used, its mode of administration and the like. Thus, it is not possible to specify an exact 
amount for every composition. However, an appropriate amount can be determined by one of 
ordinary skill in the art using only routine experimentation given the teachings herein. 

[1 83] Parenteral administration of the composition, if used, is generally characterized 
5 by injection. Injectables can be prepared in conventional forms, either as liquid solutions or 
suspensions, solid forms suitable for solution of suspension in liquid prior to injection, or as 
emulsions. A more recently revised approach for parenteral administration involves use of a 
slow release or sustained release system such that a constant dosage is maintained. See, e.g., 
U.S. Patent No. 3,610,795, which is incorporated by reference herein. 

10 [184] The materials can be in solution, suspension (for example, incorporated into 

microparticles, liposomes, or cells). These can be targeted to a particular cell type via 
antibodies, receptors, or receptor ligands. The following references are examples of the use of 
this technology to target specific proteins to tumor tissue (Seiiter, et al., Bioconjugate Chem,^ 
2:447-451, (1991); Bagshawe, KD., Br. J. Cancer, 60:275-281, (1989); Bagshawe, et al.,5r. J, 

15 Cancery 58:700-703, (1988); Senter, et al., Bioconjugate Chem., 4:3-9, (1993); Battelli, et al.. 
Cancer Immunol Immunother., 35:421-425, (1992); Pietersz and McKenzie, Immunolog. 
Reviews, 129:57-80, (1992); and Roffler, et al., Biochenu Pharmacol, 42:2062-2065, (1991)). 
Vehicles such as "stealth" and other antibody conjugated liposomes (including lipid mediated 
drug targeting to colonic carcinoma), receptor mediated targeting of DNA through cell specific 

20 ligands, lymphocyte directed tumor targeting, and highly specific therapeutic retroviral targeting 
of murine glioma cells in vivo. The following references are examples of the use of this 
technology to target specific proteins to tumor tissue (Hughes et al., Cancer Research, 49:6214- 
6220, (1989); and Litzinger and Huaag, Biochimica et Biophysica Acta, 1 104: 1 79-1 87, (1992)). 
In geneml, receptors are involved in pathways of endocytosis, either constitutive or ligand 

25 induced. These receptors cluster in clathrin-coated pits, enter the cell via clathrin-coated 

vesicles, pass through an acidified endosome in which the receptors are sorted, and then either 
recycle to the cell surface, become stored intracellularly, or are degraded in lysosomes. The 
internalization pathways serve a variety of functions, such as nutrient uptake, removal of 
activated proteins, clearance of macromolecules, opportunistic entry of viruses and toxins, 

30 dissociation and degradation of ligand, and receptor-level regulation. Many receptors follow 
more than one intracellular pathway, depending on the cell type, receptor concentration, type of 
ligand, ligand valency, and ligand concentration. Molecular and cellular mechanisms of 
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receptor-mediated endocytosis has been reviewed (Brown and Greene, DNA and Cell Biology 
10:6, 399-409 (1991)X 

a) PharmaceuticaOy Acceptable Carriers 

[185] The compositions, including antibodies, can be used therapeutically in 
5 combination with a phaimaceutically acceptable carrier. 

[1 86] Suitable carriers and their formulations are described in Remington: The Science 
and Practice of Pharmacy (\9ihGd)eA, A.R. Gennaro, Mack Publishing Company, Easton, 
PA 1995. Typically, an appropriate amount of a phaimaceutically-acceptable salt is used in the 
formulation to render the formulation isotonic. Examples of the pharmaceutically-acceptable 

10 carrier include, but are not limited to, saline, Ringer's solution and dextrose solution. The pH of 
the solution is preferably from about 5 to about 8, and more preferably from about 7 to about 
7.5. Further carriers include sustained release preparations such as semipermeable matrices of 
solid hydrophobic polymers containing the antibody, which matrices are in the form of shaped 
articles, e.g., films, liposomes or micrqparticles. It will be apparent to those persons skilled in 

15 the art that certain carriers can be more preferable depending upon, for instance, the route of 
administration and concentration of composition being administered. 

[187] Pharmaceutical carriers are known to those skilled in the art. These most 
typically would be standard carriers for administration of drugs to humans, including solutions 
such as sterile water, saline, and buffered solutions at physiological pH. The compositions can 
20 be administ^ed intramuscularly or subcutaneously. Other compounds will be administered 
according to standard procedures used by those skilled in the art. 

[1 88] Pharmaceutical compositions can include carriers, thickeners, diluents, buffers, 
preservatives, surface active agents and the like in addition to the molecule of choice. 
Pharmaceutical compositions can also include one or more active ingredients such as antimicrobial 
25 agents, antiinflammatory agents, anesthetics, and the like. 

[1 89] The pharmaceutical composition can be administered in a nimiber of ways 
depending on whether local or systemic treatment is desired, and on the area to be treated. 
Administration can be topically (including ophthalmically, vaginally, rectzdly, intranasally), orally, 
by inhalation, or parenterally, for example by intravenous drip, subcutaneous, intraperitoneal or 
30 intramuscular injection. The disclosed antibodies can be administered intravenously, 
intraperitoneally, intramuscularly, subcutaneously, intracavity, or txansdermally. 
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[1 90] Preparations for parenteral administration include sterile aqueous or non-aqueous 
solutions, suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, 
polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as etfayl 
oleate. Aqueous carriers include water, alcoholic/aqueous solutions, emulsions or suspensions, 
5 including saline and buffered media. Parenteral vehicles include sodium chloride solution. 
Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's, or fixed oils. Intravenous 
vehicles include fluid and nutrient replenishers, electrolyte replenishers (such as those based on 
Ringer's dextrose), and the hke. Preservatives and other additives can also be present such as, 
for example, antimicrobials, anti-oxidants, chelating agents, and inert gases and the like. 

10 [1 91] Formulations for topical administration can include ointments, lotions, creams^ 

gels^ drops, suppositories, sprays, liquids and powders. Conventional pharmaceutical caniers^ 
aqueous, powder or oily bases, thickeners and the like may be necessary or deskable. 

[ 1 92] Compositions for oral administration include powders or granules, suspensions or 
solutions in water or non-aqueous media, capsules, sachets, or tablets; Thickeners, flavorings, 
15 diluents, emulsifiers, dispersing aids or binders may be desirable. 

[1 93] Some of the compositions can potentially be administered as a pharmaceutically 
acceptable acid- or base- addition salt, formed by reaction with inorganic acids such as 
hydrochloric acid, hydrobromic acid, perchloric acid, nitric acid, thioc^anic acid, sulfuric acid, 
and phosphoric acid, and organic acids such as formic acid, acetic acid, propionic acid, glycolic 
20 acid, lactic acid, pyruvic acid, oxalic acid, malonic acid, succinic acid, maleic acid, and fumaric 
acid, or by reaction with an inorganic base such as sodium hydroxide, ammonium hydroxide, 
potassium hydroxide, and organic bases such as mono-, di-, trialkyl and atyl amines and 
substituted e&anolamines. 

10, Chips and micro arrays 

25 [ 1 94] Disclosed are chips where at least one address is the sequences or part of the 

sequences set forth in any of the nucleic acid sequences disclosed harein. Also disclosed are 
chips where at least one address is the sequences or portion of sequences set forth in any of the 
peptide sequences disclosed herein. 

[1 95] Also disclosed are chips where at least one address is a variant of the sequences 
30 or part of the sequences set forth in any of the nucleic acid sequences disclosed herein. Also 
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disclosed are chips where at least one address is a variant of the sequences or portion of 
sequences set forth in any of the peptide sequences disclosed herein. 

11. Computer readable mediums 

[1 96] It is understood that the disclosed nucleic acids and proteins can be represented 
5 as a sequence consisting of the nucleotides of amino acids. There are a variety of ways to 
display these sequences^ for example the nucleotide guanosine can be represented by G or g. 
Likewise the amino acid valine can be represented by Val or V. Those of skill in the art 
understand how to display and express any nucleic acid or protein sequence in any of the variety 
of ways that exist, each of which is considered herein disclosed. Specifically contemplated 

10 herein is the display of these sequences on computer readable mediums, such as, commercially 
available floppy disks, tapes, chips, hard drives, compact disks, and video disks» or other 
computer readable mediums. Also disclosed are the binaiy code representations of the disclosed 
sequences. Those of skill in the art understand what computer readable mediums. Thus, 
computer readable mediums on which the nucleic acids or protein sequences are recorded, 

15 stored, or skved. 

[ 1 97] Disclosed are computer readable mediums comprising the sequences and 
information regarding the sequences set forth herein. 

i2.iats 

[198] Disclosed herein are kits that are drawn to reagents that can be used in practicing 
20 the methods disclosed herein. The kits can include any reagent or combination of reagent 
discussed herein or that would be understood to be required or beneficial in Ihe practice of the 
disclosed methods. For example, the kits could include primers to perform the amplification 
reactions discussed in certain embodiments of the methods, as well as the bufifers and enzymes 
required to use the primers as intended. 
25 D. Methods of making the compositions 

[1 99] The compositions disclosed herein and the compositions necessary to perform 
the disclosed methods can be made using any method known to those of skill in the art for that 
particular reagent or compound unless otherwise specifically noted. 

[200] The disclosed viral vectors can be made using standard recombinant molecular 
30 biology techniques. Many of these techniques are illustrated in Maniatis (Maniatis et al., 

^Molecular Cloning-" A Laboratory Manual," (Cold Spring Harbor Laboratory, Latest edition) 
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and Sambrook et al.. Molecular Cloning: A Laboratory Manual 2nd Ed., Cold Spring Harbor 
Laboratory, Cold Spring Harbor, New York, 1989. 

1. Nucleic acid synthesis 

[20 1 ] For example, the nucleic acids, such as> the oligonucleotides to be used as 
primers can be made using standard chemical synthesis methods or can be produced using 
enzymatic methods or any other known method. Such metibiods can range from standard 
enzymatic digestion followed by nucleotide fragment isolation (see for example, Sambrook et 
aL, Molecular dloning: A Laboratory Manual, 2nd Edition (Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, N.Y., 1989) Chapters 5, 6) to purely synthetic methods, for example, 
by the cyanoethyl phosphoramidite method using a Milligen or Beckman System Plus DNA 
synfliesizer (for example. Model 8700 automated synthesizer of Milhgen-Biosearch, Burlington, 
MA or ABI Model 380B). Synthetic methods useful for making oligonucleotides are also 
described by Ikutae/ a/., y4n72. Rev, Biochenu 53:323-356 (1984), (phosphotriester and 
phosphite-triester methods), and Narang et al. Methods EnzymoL, 65:610-620 (1980), 
(phosphotriester method). Protein nucleic acid molecules can be made using known methods 
such as those described by Nielsen et al, Bioconjug, Chem. 5:3-7 (1994). 

2. Peptide synthesis 

[202] One method of producing the disclosed proteins is to link two or more peptides 
or polypeptides together by protein chemistry techniques. For example, peptides or 
polypeptides can be chemically synthesized using currently available laboratory equipment 
using either Fmoc (9-fluorenylmethyloxycaibonyl) or Boc Qiert -butyloxycarbonoyl) chemistcy. 
(Applied Biosystems, Inc., Foster City, CA). One skilled in the art can readily appreciate that a 
peptide or polypeptide corresponding to the disclosed proteins, for example, can be synthesized 
by standard chemical reactions. For example, a pq)tide or polypeptide can be synthesized and 
not cleaved fiom its synthesis resin whereas the other fragment of a peptide or protein can be 
synthesized and subsequently cleaved fiom the resin, thereby exposing a temunal group which 
is functionally blocked on the other fragment. By peptide condensation reactions, these two 
fragments can be covalently joined via a peptide bond at their carboxyl and amino termini, 
respectively, to form an antibody, or fragment tiiereof. (Grant GA (1992) Synthetic Peptides: A 
UserGuide. W.H. Freeman and Co., N.Y. (1992); Bodansky M and Trost B., Ed. (1993) 
Principles of Peptide Synthesis. Springer-Verlag Inc., NY (which is herein incorporated by 
reference at least for material related to peptide synthesis). Alternatively, the peptide or 
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polypeptide is independently synthesized i?t vivo as described herein. Once isolated, these 
independent peptides or polypeptides can be linked to fonn a peptide or fiagment thereof via 
similar peptide coBdensation reactions. 

[203] For example, enzymatic ligation of cloned or synthetic peptide segments allow 
5 relatively short peptide fragments to be joined to produce larger peptide fiagments, polypeptides 
or whole protein domains (Abrafamsen L et al., Biochemistry, 30:4151 (1991)). Alternatively, 
native chemical ligation of synthetic peptides can be utilized to synthetically construct large 
peptides or polypeptides firom shorter peptide fragments. This method consists of a two step 
chemical reaction (Dawson et al. Synthesis of Proteins by Native Chemical Ligation. Sci&ice, 

10 266:776-779 (1994)). The first step is the chemoselective reaction of an unprotected synthetic 
peptide-thioester with another unprotected peptide segment containing an amino-tenninal Cys 
residue to give a fhioester-linked intermediate as fhe initial covalent product. Without a change 
in the reaction conditions^ this intermediate undergoes spontaneous, rapid intramolecular 
reacticm to form a native peptide bond at fhe ligation site (Baggiolini M et al. (1992) FEES Lett. 

15 307:97-101; Clark-Lewis I et al., JMolChem,, 269:16075 (1994); Claik-Lewis I et al., 
Biochemistry, 30:3128 (1991); Rajaiathnam K et al.. Biochemistry 33:6623-30 (1994)). 

[204] Alternatively, unprotected peptide segments are chemically linked where Che 
bond formed between the peptide segments as a result of fhe chemical ligation is an unnatural 
(non-peptide) bond (Schnolzer, M et al. Scieiwe^ 256:221 (1992)). This technique has been 
20 used to synthesize analogs of protein domains as well as large amounts of relatively pure 

proteins with ftdl biological activity (deLisle Milton RC et al.. Techniques in Protein Chemistry 
IV. Academic Press, New York, pp. 257-267(1992)). 

3. Processes for making the compositions 

[205] Disclosed are processes for making the compositions as well as making fhe 
25 int^mediates leading to the compositions. There are a variety of methods that can be used for 
making fhese compositions, such as synthetic chemical methods and standard molecular biology 
metiiiods. It is understood fhat fhe methods of making these and the other disclosed 
compositions are specifically disclosed. 

[206] Disclosed are nucleic acid molecules produced by fhe process comprising linking 
30 in an operative way a promoter element and a )H-opioid receptor element. 
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[207] Disclosed are nucleic acid molecules produced by the process comprising linking 
in an operative way nucleic acid molecules comprising sequences set forth in SEQ ID N0:2 and 
SEQIDN0:4. 

[208] Also disclosed are nucleic acid molecules produced by the process comprising 
5 linking in an operative way nucleic acid molecules comprising sequences having 80% identity 
to sequences set forfh in SEQ ID NO:2 and SEQ ID N0:4. 

[209] Also disclosed are nucleic acid molecules produced by the process comprising 
linking in an operative way nucleic acid molecules comprising sequences that hybridizes under 
stringent hybridization conditions to sequences set forth in SEQ ID NO:2 and SEQ ID N0:4. 

10 [210] Disclosed are nucleic acid molecules produced by the process comprising linking 

in an operative way a nucleic acid molecule comprising a sequence encoding a ^-opioid receptor 
peptide and a sequence controlling an e^qxression of the sequence encoding the fi-opioid receptor 
peptide. 

[21 1] Disclosed are nucleic acid molecules produced by the process comprising linking 
15 in an operative way a nucleic acid molecule comprising a sequence encoding a ji-opioid receptor 
peptide wherein the (x-opioid receptor peptide has 80% identity to Hie peptide set forth in SEQ 
ID NO: 1 or 3 and a sequence controlling expression of the sequences encoding the peptide. 

[212] Disclosed are nucleic acid molecules produced by the process comprising linking 
in an operative way a nucleic acid molecule comprising a sequence encoding a li-opioid receptor 
20 peptide wherein tiie |Lt-opioid receptor peptide has 80% identity to the peptides set forth in SEQ 
ID NO: 1 or 3 and, wherein any change jfrom the sequences set forth in SEQ ID NO: 1 or 3 are 
conservative changes and a sequence controlling expression of the sequences encoding the 
peptide. 

[213] Disclosed are cells produced by the process of transforming the cell with any of 
25 the disclosed nucleic acids. Disclosed are cells produced by the process of transforming the cell 
with any of the non-naturally occurring disclosed nucleic acids. 

[214] Disclosed are any of the disclosed peptides produced by the process of 
expressing any of the disclosed nucleic acids. Disclosed are any of the non-natumlly occurring 
disclosed peptides produced by the process of expressing any of the disclosed nucleic acids. 
30 Disclosed are any of the disclosed peptides produced by the process of expressing any of the 
non-naturally disclosed nucleic acids. 
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[215] Disclosed are animals produced by the process of transfecting a cell within the 
animal with any of the nucleic acid molecules disclosed herein. Disclosed are animals produced 
by the process of transfecting a cell within the animal any of the nucleic acid molecules 
disclosed herein, wherein the animal is a mammal. Also disclosed are animals produced by the 
5 process of transfecting a cell within the animal any of the nucleic acid molecules disclosed 
herein, wherein the mammal is mouse, lat, rabbit, cow, sheep, pig, or primate. Also disclosed 
are non human primates and non-human mammals. 

[2 1 6] Also disclose aie animals produced by the process of adding to the animal any of 
the cells disclosed herem. 
iO E. Methods of using the compositions 

1. Methods of using the compositions as research tools 

[2 1 7] The disclosed compositions can be used in a variety of ways as research tools. 
For example, the disclosed compositions, the )i-opioid receptor constructs, and other nucleic 
acids, such as SEQ ID NOs:2 and 4 can be used to produce organisms, such as transgenic or 
15 knockout mice, which can be used as model systems for the study of pain. 

2. Methods of gene modification and gene disraption 

[218] The disclosed compositions and methods can be used for targeted gene 
dismption and modification in any animal that can imdergo these events. Gene modification 
and gene dismption refer to the methods, techniques, and compositions that surround the 

20 selective removal or alteration of a gene or stretch of chromosome in an animal, such as a 
mammal, in a way that propagates the modification through the gemi line of the mammal. In 
general, a cell is transformed with a vector which is designed to homologously recombine with a 
region of a particular chromosome contained within the cell, as for example, described herein. 
This homologous recombination event can produce a chromosome which has exogenous DNA 

25 introduced, for example in frame, with the surrounding DNA. This type of protocol allows for 
very specific mutations, such as point mutations, to be introduced into the genome contained 
within the cell. Methods for performing this type of homologous recombination are disclosed 
herein. 

[2 1 9] One of the preferred characteristics of performing homologous recombination in 
30 mammaUan cells is that the cells should be able to be cultured, because tiie desired 
recombination event occurs at a low frequency. 
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[220] Once the cell is produced tbrough the methods described herein, an animal can 
be produced from this cell through either stem cell technology or cloning technology. For 
example, if the cell into which the nucleic acid was transfected was a stem cell for the organisni, 
then this cell, after transfection and culturing, can be used to produce an organism which will 
5 contain the gene modification or dismption in germ line cells, which can then in turn be used to 
produce another animal that possesses the gene modification or disruption in all of its cells. In 
other methods for production of an animal containing the gene modification or dismption in all 
of its cells, cloning technologies can be used. These technologies generally take the nucleus of 
the transfected cell and either through fusion or replacement fuse the transfected nucleus with an 
10 oocyte which can then be manipulated to produce an animal. The advantage of procedures that 
use cloning instead of ES technology is that cells other than £S cells can be transfected. For 
example, a fibroblast cell, which is very easy to culture can be used as the cell which is 
transfected and has a gene modification or disruption event take place, and then cells derived 
from Hiis cell can be used to clone a whole animal. 

15 3. TherapeuticUses 

[22 1 ] Effective dosages and schedules for administering the compositions can be 
determined empirically, and making such determinations is within the skill in the art. The 
dosage ranges for the administration of the compositions are those large enough to produce the 
desired effect in which the symptoms disorder are effected. The dosage should not be so large 

20 as to cause adverse side effects, such as unwanted cross-reactions, anaphylactic reactions, and 
the like. Generally, the dosage will vary with the age, condition, sex and extent of the disease in 
the patient, route of administration, or whether other drugs are included in the regimen, and can 
be detennined by one of skill in the art. The dosage can be adjusted by the individual physician 
in the event of any counterindications. Dosage can vary, and can be administered in one or 

25 more dose administrations daily, for one or several days. Guidance can be foimd in the 
literature for appropriate dosages for given classes of pharmaceutical products. 

[222] Following administration of a disclosed composition, such as the disclosed 
constructs, for treating, inhibiting, or preventing pain» the efBcacy of the therapeutic constmct 
can be assessed in various ways well known to the skilled practitioner. For instance, one of 
30 ordinary skill in the art will understand that a composition, such as the disclosed constmcts, 
disclosed herein is efficacious in treating pain or inhibiting or reducing the effects of pain in a 
subject by observing that the composition reduces the onset of tiie conditions associated with 
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these diseases. Furthermore, the amoimt of protein or transcript produced j&om the constructs 
can be analyzed using any diagnostic method. For example, it can be measured using 
polymerase chain reaction assays to detect the presence of construct nucleic acid or antibody 
assays to detect the presence of protem produced from the construct in a sample (e.g., but not 
5 limited to, blood or other cells, such as neural cells) from a subject or patient, or it can be 
measured by any of the methods disclosed herein for monitoring non-human pain, and througfh 
communication for human pain. 

[223] 

F. Examples 

10 [224] It will be apparent to those skilled in the art that various modifications and 

variations can be made in the present invention without departing from the scope or spirit of the 
invention. Other embodiments of the invention will be apparent to those skilled in the art from 
consideration of the specification and practice of the invention disclosed herein. It is intended 
that the specification and examples be considered as exemplary only, with a true scope and 

15 spirit of the invention being indicated by the following claims. 

[225] The following examples are put forth so as to provide those of ordinary skill in 
the art with a complete disclosure and description of how the compounds, compositions, 
articles, devices and/or methods claimed herein are made and evaluated, and ate intended to be 
purely exemplary of the invention and are not intended to limit the scope of what the inventors 
20 regard as their invention. Efforts have been made to ensure accuracy with respect to numbers 
(e.g., amounts, tempemture, etc.), but some errors and deviations should be accounted for. 
Unless indicated otherwise, parts are parts by weigjit, temperature is in ^'C or is at ambient 
temperature, and pressure is at or near atmospheric. 

1. Example 1 Orofacial pain 

25 a) Methods and approaches 

[226] Expression of the jii-opioid receptor in neurons involved in the central processing 
of orofacial pain will result in attenuation of nociception. FIV(n-opioid receptor), a lentivirus 
capable of stably transducing terminally differentiated cells (neurons) with ji-opioid receptor, 
can be peripherally administered at orofacial sites, includmg the TM J and the masseter muscle, 
30 exposed to nociceptive substances. The efScacy of the therapy can be assessed by assaying 
change in muscle EMG as well as resistance to jaw opening. 
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(1) Construct FIVOffUMOR) and confirm its ability to 
transduce cells of neuronal ori^ 

[227] The opening reading frame of the HUMOR cDNA having the sequence set forth 
in SEQ ID N0:2 was (Raynor K, et aL, J Pharmacol Exp Ther^l995; 272:423-8) cloned into 
5 the pFIV vector using standard molecular biology methods generating pFIV(HUMOR) SBQID 
NO: 7 (See Figure 4). Constructs having two different promoters were made to drive the 
expression of HUMOR, the neuron specific enolase (NSE) promoter as weU as the 
cytomegalovmis (CMV) promoter. The NSE promoter limits the expression of HUMOR 
selectively in neurons at constitutive levels, whereas the CMV promoter results in ubiquitous 

1 0 levels of expression independent of cell type. Clones were selected following DNA isolation 
and confimiation by multiple restriction digestions and direct DNA sequencmg of both strands. 
pFIV(HUMOR) can be co-ttansfected with pPAK and pVS V-<J visctors in 293H cells utilizing 
the lipofectamine 2000® reagent (Gibco/BRL-hivitrogen) for virus production as previously 
described ^E^igure 3). Following 60 hours incubation, the supernatant will be collected, filtered 

15 and titered. Titers of 1 0^ ip/ml are typical using this method. If required, the titers can be 
increased by ultra-centrifiigation of the supernatant and re-suspension in normal saline, 

[228] FIV(NSE-HUMOR) as well as FIV(CMV-HUMOR) efficacy can be tested on 
the N2a neuronal cell line, whereby cells can be seeded on 12 well plates and infected by 
FIV(HUMOR), followed by a 24 hour fi«sh media change. DNA, RNA and protein samples 

20 can be harvested using standard laboratory methods at 72 hours post infectioiL The presence of 
HUMOR gene copies in the infected samples can be assessed by Q-PCR, and the expression of 
HUMOR can be determined at the mRNA and protein levels by RT-PCR and western 
immunoblotting, respectively. In addition, other cells can be fixed by 4% paraformaldehyde for 
HUMOR protein expression detection and visualization by iimnunocytochemistry. Figure 1 or 

25 2 demonstrates the expression of HUMOR in N2a cells following transient transfection and 

detection of its expression by inunimocytochemistiy utilizing a commercially available antibody 
raised against HUMOR (Research & Diagnostic Antibodies, Benicia CA Cat# AS-3942S). The 
FIV vector can infect and stably transduce sensory trigeminal neurons following local peripheral 
administration as demonstrated by the data in Figure 3. At the animal behavioral level, the 

30 targeted expression of HUMOR in orofacial sensory neurons can result in attenuation of 

nociceptive symptoms that readily can be measured as change (decrease) in EMG activity and 
decrease in resistance to jaw opening. 
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(2) Targeted expression of ^-opioid receptor in orofacial 
somatosensory neurons can attenuate orofacial pain following 
administration of algesic substance 

[229] The jfeline immunodeficiency viral vectors FIV(NSE-HUMOR) and FIV(CMV- 
5 HUMOR) can be employed in the transduction of trigeminal sensoiy neurons with the ^.-opioid 
receptor. Oio&dal nociception can be quantitatively assessed by means of masticatory muscle 
activity, as measured by electromyography (EMG), as well as resistance to mouth opening, as 
measured by a digital force transducer. Table 6 summarizes the experimental conditions. In 
brief, FIV(NSE-HUMOR) and FIV(CMV-HUMOR) can be injected (50 \d of 10 Vml) in the 

10 TMJ or masseter muscle of anesthetized animals, which can be returned to their cages after 
recovery. Five weeks post-treatment EMG bipolar percutaneous hook electrodes can be 
inserted in the masseter and temporal muscles under anesthesia and an orthodontic Kobayashi 
hook can be bonded to the mandibular incisors for attachmrat to the digital force gauge. The 
animals can then be positicmed in a custom made restraining device . Base line EMG and 

15 resistance to jaw opening measurements can be taken fiwr every animal. Four channels of 
simultaneous EMG signal (right and left masseter and tenq)oral muscles) and one channel for 
the digital force gauge can be recorded. An A/D conversion card (NI016E1, National 
Jhstruments) for the EMG signal and a digital force gauge (FGF series, Kemco Instruments) can 
be used for signal collection. A 'blinded" observer can collect 10 seconds of EMG and digital 

20 force measurements at S, 10 and 15 mm of vertical opening five times simultaneously. Data can 
be stored for later analysis. The appropriate animals can then be injected in the 
temporomandibular joint and masseter muscle area wifli the algesic agent glutamate (2.5 pmol 
in saline) or vehicle solution (saline), and EMG and resistance to jaw opening recordings can be 
taken. Data can be analyzed by two-way ANOVA with repeated measures. Student's t-test can 

25 compare the two groups. These experiments replicate conditions seen in the human condition. 
(Molin CActa OdotUolScand 1972; 30:485-99; MollerUM, et al., Scand J Dent Res 1984; 92: 
64-83; Stohler CS, et aL, Helv Odontol Acta 1 985; 29: 13-20; Stohler CS, et al.. Arch Oral Biol 
1988; 33: 175-82; Stohler CS. Temporomandibular disorders and related pain conditions. In: 
Sessle, B.J., Bryant, P.S. and Dionne, RA. Editors, 1995. Progress in pain research and 

30 management Elsevier, Amsterdam, pp. 3-30; Lund JP, Stohler CS. Effect of pain on muscular 
activity in temporomandibular disorders and related conditions. In: Stohler, CS., Carlson, D.S., 
editors. Biological and Psychologica] aspects of orofacial pain, craniomandibular growth series. 
Ann Arbor, MI: University of Michigan, 1994. pp. 75-91; Hoshio T. Senior Thesis, Division 
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of Orthodontics, Eastman Department of Dentistry, University of Rochester School of Medicine 
and Dentistry. Rochester, New York; Balkhi KM, et al., J Orofacial Pain 1993;7:89-97). 

[230] HUMOR expression can be confirmed following the experimental recordings in 
deeply anesthetized animals that can be sacrificed via transK^ardial perfusion of 4% 

5 paraformaldehyde, and the brain stem, trigeminal ganglia, masseter muscle and TMJ can be 
excised and collected for fiirflier analysis. Specifically, we can characterize the expression of 
HUMOR in the central and peripheral tissues utilizing,inununocytochemistry as depicted by our 
pieliminazy data. Cell type specificity can be accomplished by double immunofluorescence, as 
previously demonstrated by our laboratory. Potential deleterious effects of HUMOR expression 

10 (or over-expression in the CMV driven gene) to neurons or other cells can be examined by 
Hoecbst staining and confirmed by TUNEL (in fhe case of cell deatii) as well as by the 
expression of neuron specific housekeeping genes, such as neuron specific enolase as well 
substance P (expressed selectively by sensory neurons). 



6. TABLE 6 Summary of experimental conditions 



ALGESIC 
AGENT 


SITE of 
NOCICEPTION 


TREATMENT 


N 


Glutamate 
saline 


Temporomandibular 
Joint 
Masseter muscle 


FIV(NSE-HUMOR) 

FIV(CMV-HUMOR) 
morphine 
FIV(NSE-HUMOR)+opioid 
FIV(CMV-HUMOR)+opioid 
saline 


10 
each 



15 [23 1] Patimts presenting with oro&dal pain fix>m TMJ disorders have been 

characterized as having decreased bite and chewing forces and limited jaw opening . For 
example, reduced grip strength (Norsdenskiold UM and Grimby G., Scand J Rheumatol 1993; 
22: 14-9.), sustained jaw closing pain (Jow RW and Clark GT., Arch Oral Biol 1989; 34: 857- 
62.) and reduced bite strength (Molin C, Acta Odontol Scand 1972; 30:485-99) have all been 

20 reported in patients with muscle pain. The reduction in muscle force exertion associated with 
myalgia has been suggested to be due to reduced activity of agonist muscles and increased 
activity of antagonist muscles (Molin C, Acta Odontol Scand 1972; 30:485-99; Stohler CS, et 
al., Helv Odontol Acta 1985; 29:13-20; Stohler CS, et iiUArch Oral Biol 1988; 33: 175-82; 
Stohler CS. Temporomandibular disorders and related pain conditions. In: Sessle, B.J., Bryant, 

25 P.S, and Dionne, RA. Editors, 1995. Progress in pain research and management Elsevier, 
Amsterdam, pp. 3-30; and Lund JP, Stohler CS. Effect of pain on muscular activity in 
temporomandibular disorders and related conditions. In: Stohler, C.S., Carlson, D.S., editors. 
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Biological and Psychological aspects of orofacial pain, craniomandibular growth series. Ann 
Arbor, MI: University of Michigan, 1994. pp. 75-91.) Keil et al., (Keil LJ, et al., Pain 2000; 
85:333-43) have demonstrated that forelimb grip force reduction is a behavioral index of 
hyperalgesia in the carrageenan model of muscle hyperalgesia. This would translate to 
5 reduction of bite force and an increase in antagonist muscle activity in the orofacial region. 
These experiments in have been done in humans (Lund JP, Stohler CS. Effect of pain on 
muscular activity in temporomandibular disorders and related conditions. Jn: Stohler, C.S., 
Carlson, D.S., editors. Biological and Psychological aspects of orofacial pain, craniomandibular 
growtfe series. Ann Arbor, MI: University of Michigan, 1994. pp. 75-91; HoshioT. Senior 

10 Thesis, Division of Orfliodontics, Eastman Department of Dentistty, University of Rochester 
School of Medicine and Dentistry. Rochester, New York; and BaDchi KM, et al., J Orofacial 
Pain 1993;7:89-97.). Hoshio has also demonstrated that compared to controls, patients with 
TMD demonslrate decreased bite (201 and 223 mV for the masseter muscles respectively in 
asymptomatic volunteers and 128 and 153 mV for symptomatic patients). Balkhi (Balkfai KM, 

15 et al., J Orofacial Pain 1 993;7:89-97) demonstrated that chewing force was decreed in patients 
with pain (1 13 and 102 mV for deliberate right and left side chewing of gum masseter muscles 
respectively in asymptomatic volunteers and 85 and 83 mV for symptomatic patients). It has 
been demonstrated that there was increased jaw muscle activity of antagonists during jaw 
opening.. These data clearly demonstrate that patients with TMD have similar characteristics 

20 and that decreased bite force and chewiag activity are a reflection of somatic pain. 

[232] Mice (C57/B6) can be employed to make the transgenic mice. 

a) Injection of replication defective HVOBDUMOR) vectors 

[233] The animals, such as pups, can be anesthetized with ketamine (60 mg/K^ and 
xylazine (5 mg/Kg) IP. To verify the induction of surgical anesthesia, a toe is pinched in order 

25 to test for reflex withdrawal. One type of injection will be performed m two distmct areas. 
Under surgical plane of anesthesia the animals, such as mice, can be mjected with 50^ of 10^ 
ip/ml of FIV(HUMOR) using a 1ml syringe with a 27^^^ gage needle dhrectly into the 
temporomandibular joint and masseter muscle. The animals can be identiJHed by ear punching. 
Animals can be held at the base of the tail with distal portion of tail situated on surface of 

30 nesdet, for example. Using a straight edge blade, -'7mm of distal tail can be removed, and flie 
mouse can be placed in a cage and the tail specimen stored in a vial labeled by mouse ID# and 
sex. The mice are euthanised with sodium pentobarbital (200 mg/kg). 
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[234] Fixation by intracardial transfusion can be performed. Upon exposure of the 
heart, the right atrium can be clipped and the left ventricle can be catheterized with a 17 gage 
needle through which 50ml of 4% paraformaldehyde solution in phosphate buffered saline can 
be transfused into the animal. The liver, spleen, kidney and brain can be dissected and post- 
5 fixed until sectioned for histology. The middle part of the cranium, including the cranial base 
(sphenoid, ethmoid, maxilla) can also be dissected, demineialized by immersion into an EDTA 
solution and section for histology. 

2. Example 2: Non-Primate Lentivlral Vector Administration in tlie TMJ 

[235] Diclosed herein aie the ejBfects lentiviial vectors on the temporomandibular joint 
10 Defective feline immunodeficiency virus capable of infecting dividing as well as terminally 

difieientiated cells with the reporter gene lacZ, the expression of which was studied by means of 
PCR, X-gal histochemistcy and p-galactosidase immmaocytochemistiy were injected into fhe 
' articular joint space. The results showed successful transduction of hard and soft tissues of the 
temporomandibular joint. Laterestingly, a subset of primary sensory neurons of flie ipsilateial 
15 trigemmal ganglion also stained positive for the reporter gene, presumambly following uptake 
of the lentiviral vector by peripheral nerve fibers and retrograde transport to the nucleus. These 
findings indicate that transfer of anti-nociceptive genes, and disclosed herein, genes such as t he 
opiod receptors, can be transferred into nerve cells and relieve pain. For example, lentiviral 
vectors can serve as the platform for the transfer of anti-nociceptive genes for the management 
20 of temporomandibular joint pain. 

a) Materials and Methods 

(1) Vector Construction and Packaging 

[236] The defective, vesicular stomatitis (VSV-G) pseudotyped, feline 
immunodeficiency virus, FIV(lacZ), capable of transducing dividing, growth-arrested as well as 

25 post-mitotic cells (neurons) with the reporter gene lacZ driven by the ubiquitous 

cytomegalovirus promoter, CMV (Poeschla EM, et al. (1998). Nature Med 4: 354-357) was 
employed. The vectors were kindly donated to us by Dr. Wong-Staal, University of California at 
San Diego. A schematic description of the vector is depicted in Figure 6A. In addition, a 
control FIV(AMac) vector carrying an inactive [J-galactosidase was constructed by deleting the 

30 first 1,000 bp of the /flcZ gene (3.75 kb in total), including the transcription initiation site (Fig. 
6A), Specifically, the FIV(lacZ) vector was digested in vitro with the Sstll and Cla I restriction 
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enzymes overnight at ^TC^ followed by agarose gel purification. The ends of the backbone 
DNA were blunted with the T4 DNA polymerase (Invitrogen, Carlsbad CA) and ligated with T4 
ligase (Invitrogen) according to manufacturer's instructions. The FIVQacZ) and FIV(A'lac) 
vectors were transiently co-transfected along with the packaging and VSV-G vectors into 293H 
5 ceDs (GIBCO/BRL) cultured in DMEM (Invitrogen) plus 10% FBS (Gemini, Woodland CA) 
using the Lipofectamine 2000 reagent per manufacturer's instructions (Invitrogen), and followed 
by a fresh media change supplemented by non-essential amino acids (Invitrogen). Sixty hours 
post-transfection, the supernatant was collected, filtered through .45mm Surfil®-MF filter 
(Coming Seperations Division, Acton MA), aliquoted and firozen until further use. Titering was 
10 performed on CrfK cells (American Tissue Culture Collection; Manassas, VA) cultured in 24 
well tissue culture plates, and assessed at at 5x1 0^ blue forming units (bfii) / mL by X-gal 
histochemistiy. 

(2) Animal Injections 

[237] All methods pertinent to animal utilization were approved by the University 

15 Committee on Animal Resources. Specifically, 12 male imce, C57BL/6J, under surgical plane 
of anesthesia (ketamine 60mg/Kg and xylazine 5 mg/Kg administered intraperitoneally) 
received a single injection of 5X10^ FIV(lacZ) infectious particles (100 jiil of stock solution) in 
tiie joint space of the right TMJ. Four additional mice received a single injection of 5X10^ 
FIV(A*lac) infectious particles (100 |xl of stock solution) in the joint space of the right TMJ. In 

20 brief, the hair of the skin covering the right TMJ was shaved and the skin cleaned with Betadine 
solution. The joint was approached with an antero-posterior incision between the posterior end 
of the zygomatic arch and the ear cartilage, followed by a blunt dissection to expose the 
zygomatic arch and the posterior margin of the articular emminence. The joint space was not 
exposed during this procedure. The posterior margin of the emminense was identified by 

25 palpation and a 1-mI tuberculin syringe with a 271/2 gage needle was employed to inject the 
experimental solutions in the joint. This surgically assisted intra-articular injection technique 
was utilized to minimize leakage or spreading of the injectable solution beyond the articular 
space. (Kyrkanides S, et al. (2002). J Orofac Pain 16: 229-235). In addition, 2 mice that 
received 100 fil saline injection served as controls. Forty-five days following treatment, the mice 

30 were deeply anesthetized by pentobarbital (lOOmg/Kg IP) and euthanised by transcardial 
perfusion of 4% paraformaldehyde in phosphate buffered saline (PBS) (Kyrkanides S, et al. 
{2002^). J Orofac Pain 16: 229-235, Kyrkanides S, et al. {2002). Mol Brain Res 104: 159-169). 
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The trigeminal ganglia and brain stem were dissected and sectioned at 20 using a freezing 
microtome. The TMJ joints were also dissected, decalcified in an EDTA buffered solution, 
embedded in paraffin and cut at 8 jmi sections. All tissues were stored at -20^C until further 
processed. 

5 (3) X-Gal Histochemistry 

[238] Sections of trigeminal ganglia were processed by X-gal histochemistry and 
evahiated under light microscopy. Specifically, the sections were washed in 0. 1 5M phosphate 
buffered saline (PBS) pH 7.2 for 60 min, followed by overnight processing in a staining solution 
containing 5-bromo-4chloro-3-indolyl-p-D-galactopyranoside (Img/ml), potasium ferricyanide 

10 (3mM), potasium ferrocyanide (3mM), NP-40 (0.02%) in 0.1 M PBS pH 7.2 (Invitrogen) and 
MgCU (1 .3mM). The tissue was then washed in PBS for 30 min, and briefly rinsed with dHaO. 
Considerable attention was given so that only the bacterial form of p-galactosidase waa 
detected. The slides were cover slipped with DPX mounting medium (Fluka, Neu-Ulm, 
Switzeriand) and examined under a light microscope (BX51 Olympus; Tokyo, Japan). Color 

15 microphotographic images were 'captured in TEFF 1 6-bit format using a SPOTRT Color CCD 
digital camera attached onto the microscope and connected to a PC computer. 

(4) Cell Counting 

[239] The mouse ganglia (1 .5mm X 2mm X 3mm) were sectioned sagitally on a 
fi:eezing cryotome along their long axis into 20pm thick sections. A total of 42 sections were 

20 approximately produced firom each ganglion, which were sequentially collected onto 3 glass 
slides, whereby each slide contained representative ganglion sections 60|Lma apart of each other. 
One glass slide of each ganglion was processed by X-gal histochemistry and whas employed io 
cell counting: all X-gal positive (blue) cells were counted on each tissue section on the slides. 
Since the tissue sections were 60 pm apart, counting all blue cells on a single slide gave a 

25 representative number of infected cells in each ganglion while avoiding overlap between 
sections and subsequently any "double counting". 

(5) Immunocytochemistry 

[240] Tissue sections from trigeminal ganglia were analyzed by immunocytochemistry 
employing a rabbit anti p-galactosidase polyclonal antibody (Chemicon INTL, Tamacula CA). 
30 In brief, sections were washed in BPS for 60 min followed by a 30 min blocking step in nonnal 
goat semm (4% in PBS) and overnight incubation in the primary antibody solution containing 
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rabbit anti P-galactosidase polyclonal antibody (1 :2,500), 0.5% Triton-X, 4% normal goat serum 
(Invitrogen), 1% bovine serum albumin (Sigma; St Louis, MO) in PBS. The next morning the 
tissue was washed in PBS for 60 min, followed by a 30 min blocking step and incubated for 90 
min in the secondary antibody solution containing a goat anti-rabbit polyclonal antibody 
5 (1 :2,000), Triton-X (0.5%) and nonnal goat serum (0. 1 5%) in PBS. Subsequently, the tissue 
was washed in PBS for 30 min and incubated in a avidin-biodin complex solution (ABC kit; 
Vector Laboratories, Biu-lingame CA), and was tben washed in 0.1 M sodium acetate buffered 
solution (pH 7.4) for 30 min. The tissue was then reacted in a DAB (3,3' diaminobenzidine) - 
Nickel solution in O.IM sodium acetate buffered solution (pH 7.4) for 5 min, followed by a 15 
10 min wash in PBS (Kyrkanides S, et al. (2002), J Orofac Pain 1 6: 229-235, Kyrkanides S, et al. 
(2002). Mol Brain Res 104: 1S9-169). The glass slides were then dehydrated fhiough multiple 
efhanol solutions, cleared through xyaline and cover-slipped using DPX p^manent mounting 
medium. The tissue sections were tb^ studied under a light microscope and microphotbgraphic 
images were captured as described above. 

15 [241] Tissue sections fiom the temporomadibular joints were fbrstdeparaf^ 

hnmersion in a series of xylines and alcohols, followed by antigen retrieval processing (9S^C 
heating for 15 sec in 0. 1 MTris-HCL buffer pH 8.9) and processing employing the 
aforementioned immunocytochemical method. 

(6) Polymerase Chain Reaction (PCR) 

20 [242] The DNA from the left and right trigeminal gangjia of 8 mice (4 control and 4 

e}q)erimratal) was extracted employing the Trizol reagent (Ihvitrogm) according to 
manu&cturer^s instructions. The concentration of the recovered DNA ranged between 17-50 
ng^jd, and was analyzed for the presence of viral DNA by PGR employing the following primer 
sets. Detection of FIV viral DNA (Fig.6A): ^'TTT TTC CAG TTC CGT TTA TCC (SEQ ID 

25 NO:35) and TTT ATC GCC AAT CCA CAT CT^' SEQ ID NO. 36 CrA==58'*Q 40 total cycles). 
Detection of active p-galactosidase gene (Fig. 6A): ^'CCC ATA GTA ACG CCA ATA GG 
(SEQ ID NO:37) and AAA TGT GAG CGA GTA ACA ACC^* SEQ ID NO. 38 CrA=59.6°C; 45 
total cycles). Detection of genomic DNA was performed utilizing primers designed for the 
murine G3PDH house keeping gone: ACC ACA GTC CAT GCC ATC AC SEQ ID NO. 39 and 

30 TCC ACC ACC CTG TTG CTG TA SEQ ID NO. 40 (Ta=58°C; 30 cycles). A total of 400 ng 
was used as DNA template in flie PCR reactions. The PCR products were analyzed by agarose 
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gel (1% w/v) electrophoresis and the images were captured utilizing a KODAK Image Analysis 
system (Rochester NY). 

b) RESULTS 

(1) Intra-articular FTV injection resulted in tranduction of 
5 hard and soft tissues 

[243] FlV(lacZ) injection into the TMJ articular space resulted in transfer of the 
reporter gene lacZ via the lentiviral vector in cells located within the articular capsule. 
Specifically, cells of the TMJ meniscus^ presumably fibroblasts, expressed bacterial p- 
galactosidase as it was assessed by inmiunocytochemistry employing appropriate polyclonal 

10 antibodies, hi addition, cells located in the hypertrophic zone of the condyle, primarily 

comprised of cartilaginous cells, as well as perivascular cells, including endothelial cells and 
possibly ostepcytes, also stained positive for bacterial ^-g^ctosidase (Fig. 5). There was lack 
of P-galactosidase in the contralateral joints as well as the saline injected animals. These results 
indicate FIV successfully infected and stably transferred the reporter gene to cells of hard and 

15 soft TMJ tissues. 

(2) FIV Injection into the TMJ resulted in transduction of 
trigendnal neurons 

[244] Two FIV vectors were employed in our experiment: the wild type FIV(lacZ) and 
the mutated FTV (A' lac) (Fig. 6A). FTV(A'lac) is capable of transducing cells with an inactive 

20 form of the reporter gene P-galactosidase compared to FIVQacZ) which carries a full-length 
lacZ (Fig. 6B & 6C). Injection of either FIV vector in the right TMJ of mice resulted in 
transduction of neurons located in the ipsilateral trigeminal ganglia as assessed by PGR (Fig. 
7 A). Full length lacZ gene was detected by PGR only in the FIV(lacZ) treated animals (Fig. 7B), 
accompanied by neuronal |}-galactosidase expression as assessed by X-gal histochemistry. The 

25 X-gal staining was localized primarily in the posterolateral part of the ganglion within the cell 
bodies of cells that appear histologically as neurons (Fig. 8A & 8B). In fact, the cell bodies of 
the primary sensory neurons that innervate the TMJ are known to localize in this part of the 
trigeminal ganglion. In contrast, FIV(A'lac} injected mice did not display any X-gal positive 
cells in the ganglia (Fig. 8C). Expression of bacterial ^galactosidase in the trigeminal gang^a 

30 was also confirmed by immunocytochemistry in the FIV(lacZ) (Fig. 8D) but not tiie FIV(A'lac) 
treated mice (Fig. 8E). Moreover, analysis of sections firom the bram stem did not reveal any X- 
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gal positive staining, as it was anticipated since the vectors are defective, do not replicate and 
cannot infect second order neurons. 

(3) Considerable number of trigeminal neurons were infected 
byFIV(IacZ) 

5 [245] The mouse ganglia were on average of the following dimensions: 1 .5mm X 2mm 

X 3mm. As described above, a total of 42 sections (20 |jin thick) were produced approximately 
fix)m each ganglion, which were sequentially collected onto 3 glass slides, whereby each slide 
contained representative ganglion sections 60pm apart of each other. Consistently, 4 sections 
were identified containing X-gal (blue) cells on each glass sKde, with an average of 93 (+A 7.64 

10 S.D.) blue cells per section. Therefore, we infer that there were approximately 93 cells X 4 
sections X 3 glass slides = 1,116 transduced neurons in each right-sided ganglion in the FIV- 
injected animals. No X-gal positive cells were identified in the saline injected animals. These 
results suggest that firom a total of 5x10^ infectious particles injected into the articular TMJ 
space approximately 10^ nerve fibers were infected resulting in lacZ expression, presumambly 

1 5 following uptake of the lentiviral vector by peripheral nerve fibers and retrograde transport to the 
nucleus. 

[246] The results shown herein demonstrated tiiat intra-articular injection of FIV(lacZ) 
resulted in successful gene transfer to articular TMJ surfaces as well as the joint meniscus. 
Interestingly, VSV-G does not require interaction between the viral envelope protein and a 

20 specific membrane receptor, but instead interacts with a phospholipid component of the cell 
membrane leading to membrane-fiision mediated entry. This characteristic confers broad host- 
cell range for VSV-G pseudo^ed viruses (Bums JC, et al. (1 993). Proc Natl Acad Sci USA 90: 
8033-8037; Cameiro FA, et al. (2002). J Virol 76: 3756-3764). Therefore, it is possible that FW 
vectors demonstrate higher infectivity for TMJ tissues than previously described viral vectors 

25 (Kuboki T, et al. (1999). Arc Oral Biol 44: 701-709), as well as result in prolonged transgene 
expression secondary to stable transgene integration (Poeschla EM, et (1998). Nature Med 4: 
354-357): 

[247] The efBcacy of VSV-G pseudotyped FIV vectors to transduce peripheral tissues 
(Kang Y, et al.(2002). J Virol 76: 9378-9388), as weU as the brain (Bloemer U, et al. (1997). J 
30 Virol 71 : 6641-6649) and cerebellum (Alisky JM, et al. (2002) MolNeurosci 1 1 : 2669-2673) 
has been previously demonstrated. The present observations of cells staining positively for X- 
gal in the trigeminal ganglion ipsilateral to the site of injection indicates that FIV virions were 
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taken up by peripheral nerve projections of trigeminal sensory neurons that lead to infection and 
expression of the reporter gene lacZ by these neurons. Therefore, VSV-G pseudotyped 
lentiviruses, such as the defective feline or human immunodeficiency virus, can serve as the 
platform for the transfer of anti-nociceptive genes to temporomandibular joint tissues as well as 
5 the neurons that innervate these stmctures. 

3. Example 3: 

a) Vector construction 

[248] The ViraPower™ Lentiviral Expression System that can create a replication 
incompetent HTV-l-based lentivirus (Invitrogen, Carlsbad CA) was employed. This system can 

10 deliver and express NSE/Hinnan-|i-opioid receptor in either dividing or non-dividing 
mammalian cells. First the pLentieA'^S-D-TOPO vector was reconstracted by insert PGR 
product which was generated base on the multiple cloning site of pIRES vector (Qontech lac, 
Palo Alto CA) with the PGR primers: MCS-upper primer 5*- 
CACCTAATACGACTCACTATAGG-3' SEQ ID NO. 41 and MCS-lower primer 5'- 

15 CATTAACCCTCACTAAAG-3' SEQ ID NO. 42. This 707 bp PGR product was purified and 
cloned into plenti6A^5-D-TOPO vector directional (CACC, 4 base pair with overhang sequence 
will anneal to the GTGG sequence in the pLenti6A^5-D-TOPO vector). Multiple cloning sites 
w^ used as the template for PGR amplification to insert Nhel site to the S' end and a multiple 
enzyme digestion sequence followed by a NotI site to the 3' end of fiagment . Then the CMV 

20 promoter of pLenti6Ar 5-D-TOPO was removed wifli Glal and Spel restriction enzyme 

digestions, followed by isopropanol DNA purification. Both ends of the le-constructed vector 
were blunted with T4 DNA polymerase (Invitrogen, Carlsbad CA) and ligated with T4 DNA 
ligase (Invitrogen) according to manufacturer's instruction. 

[249] The NSE promoter was originally firom pTR-NT3myc-NSE vector (Described in 
25 Peel AL. et al.. Gene Therapy. 4(l):16-24, 1997). The NSE promoter sequence can be found in 
SEQ ID NO:52.). The NSE fiagment was cut out with BgU and HindHI restriction enzyme 
digestions. The BgH site of this fragment was blunted. Later on, NSE fi-agment (2050 bp) was 
ligated into HmdTTT and blunted Xhol sites of pBluescript n KS+/- phagemid to form 
pBluescriptHKS-NSE. 
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[250] Huinan-|Li-opioid receptor (HUMOR) DNA was from pcDNA3-Hunian-fi-opioid 
receptor (Wang JB, et al., PNAS USA 90:10230-4) The sequence of this particular Human-p.- 
opioid receptor is found in SEQ ID NO:53. 

[25 1] This 1 .6 Kb fragment was cut out of flie vector with EcoRV and Xbal, and 
5 inserted into EcoRV/Xbal sites of pBluescript n KS-NSE to form pBluescript II KS-NSE- 
HUMOR. This structure was digested with Kpnl and blunted, followed by tiie digestion of 
Xbal. The whole NSE-Human (^-opioid fragment was ligated into pIRES plasmid at Xbal and 
EcoRI (bhmt) sites, and become pIRBS- NSE-Human fi-^pioid vector. To Insert Ae NSE- 
human ^-opioid receptor genes into the constructed pLenti6A^5-D-TOPO without CMV 
10 promoter, pIRES-NSE-human ji-opioid was digested with Nhel and Sail restriction enzymes 
and ligated into Nhel/Sall sites of pLenti6A^5-d-TOPO at overnight and pLentieA/'S-D- 
TOPO-NSE-HUMOR vector was constructed. 

[252] In order to increase the efficacy of virus packaging, cPPT sequence was added to 
the front of NSE of the pLenti6A^5-D-TOPO-NSE-HUMOR. Plasmid pLPl (SEQ ID NO:49) 

15 . was used as template to PGR amplifying cPPT fragements with Clal and Nhel sites at both ends 
(upper primer 5'-atatcgatatcgctagcttttaaaagaaaaggggg-3' SEQ ID NO. 43 and lower primer 5'- 
taatcgatgctaagcaaaattttgaatttttgtaatttg-3' SEQ ID NO. 44). The PGR products were digested with 
Nhel, and resulting fragment was ligated into the Nhel site of pLenti6/V5-D-TOPO-NSE- 
HUMOR to generate pLenti6A^5-D-TOPO-cPPT-NSE-H UMOR plasmid at 4**G overnight A 

20 schematic description of the vector is disclosed herein. 

[253] The pLenti6/v5-D-TOPO-cppt-NSE-HUMOR (SEQ ID NO:48) were transiently 
co-transfected along wifli the three packaging plasmids, pLPl(SEQ ID NO:49), pLP2 (SEQ ID 
NO:50),and pLPA^SVG(SEQ ID N0:51), into 293FT cells (Invilrogen) cultured in DEME 
(Invitrogen) plus 10% FBS (Gemuri, Woodland GA). After 24 hours, the medium was replaced 
25 with fresh medium supplemented with non-essential amino acid (Invitrogen). Seventy hours 
post-transfection, the supernatant was collected and filtered through 0.45 pm Acrodisc 25 mm 
syringe filter (Pall Corporation, Gelman Laboratory). Aliquots of virus were frozen at -80^C 
until further use. Tittering of the vims was performed on NIH3T3 cells cultured in 6 well tissue 
plates and assessed at 3x 1 0 ^ colonies (bfu)/ml by blasticidin selection. 
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b) Infection of albino neurons cells with Lenti6/NS£-Hunior virus 

[254] Neuro-2a cell was plated into 6 well plate and cultured in MEM (GBCO/BRL) 
with 10% FBS. To infected Lenti6/NSE-HUM0R virus, 1ml viral solution (3x10 ^ ip) with 6 ^lg 
polybrene solution was added to N2a cell culture. After overnight plating, the medium was 
5 changed to regular MEM wifli 1 0% FBS. The cells were harvested after 96 hours infection. 

c) RT Polymerase Chain Reaction (RT-PCR) 

[255] The total RNA of the N2a cells infected with Lenti6/NSE-HUM0R virus was 
extracted with Trizol reagent (hivitrogen) according to manufacture instruction. 5 )ig total RNA 
was used to syntheses first strain DNA with SuperScrq>t™ First<-Strand Synthesis System for 

10 RT-PCR (hivitrogen). Analysis of the presence of HUMOR gene was done by PGR, employing 
the foUowmg primer sets: 5*-GAATTACCTAATGGGAACATGG-3' (SEQ ID NO:45) and 5*- 
GCAGACGATGAACACAGC-3* (SEQ ID NO:46) (Ta= 56**C, total 30 cycles). G3PDH house 
Iceepmg gene was used as quantity PCR control. Detection of genomic G3PDH DNA was 
perfomied with primers 5'-ACCACAGCAATCAC-3' (SEQ ID NO:47) and 5'- 

15 TCCACCACCCTGTTGCTGTA-3' (SEQ ID NO:40)(Ta=58*'C, 30 cycles). The PCR products 
were analyzed by agarose gel (1 % vr/v) electrophoresis and imaged were c^tured by a KODAK 
image analysis system (Rochester NY). 
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H. Sequences 

1. SEQ ID NO:l Homo sapiens HUMOR^ protein Genbank Accession No. 

2. SEQ ID NO:2 Homo sapiens HUMOR, cDNA Genbank Accession No 
3 • SEQ ro NO:3 Murine HUMOR, protein Genbank Accession No 

4. SEQ m NO:4 Murine HUMOR, cDNA Genbank Accession No 

5. SEQ H>NO:S: human kappa opioid receptor cDNA 

6. SEQ TD NO:6 human delta opioid receptor cDNA sequence 
7 • SEQ D) NO:7 FIV(Opiold receptor construct) 

8. SEQ H> NO:8 FIV(LacZ) (a construct can be used for nerve 
transduction) 

9. SEQ n> NO:9 HUMOR degenerate cDNA G to A change at position 94 

10. SEQ ID NO:10: HUMOR polypeptide conservative substitution of 
Val32toI32 

11. SEQ ID NO:ll: Neuron specific enolase promoter 

12. SEQ ID NO:12 FIV backbone 

13. SEQ ro NO:I3: Pacfe:aging vector 

14. SEQ ro NO:14 FIV-NSE-HUMOR -p A 

15. SEQ ro NO:lS: Mu-opioid RECEPTOR Bovine ACCESSION 
NP_776833 

16. SEQ ro NO:16: Bos taurus mu opioid receptor mRNA, complete cds. 
ACCESSION U89677 

17. SEQ ro NO:17:mu opioid receptor - mouse. 
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18« SEQ ID NO:18: Mus musculus mu opioid receptor cDNA, complete cds 
ACCESSION m9380 

19. SEQ ID NO:19: mu opioid receptor - rat ACCESSION IS6504 

20. SEQ ID NO:20: Rat mu opioid receptor mSNA, complete cds. 

5 21. SEQ ID NO:21: mu opioid receptor [Sus scrofa] pordne ACCESSION 

AAB53770 

22. SEQ ID NO:22: Sus scrofa porcine mu opioid receptor mRNA» complete 
cds ACCESSION AF521309 

23. SEQ ID NO:23: DELTA-opioid RECEPTOR ACCESSION AAA18789 

10 24. SEQ ID NO:24: Human delta opioid receptor mRNA, complete cds 

ACCESSION U07882 

25. SEQ ID NO:25: delta opioid rec^tor [Sus scrofa] ACCESSION 
AAB39694 

26. SEQ ID NO:26: delta opioid receptor PRattus norvegicus] ACCESSION 
15 AAA19939 

27. SEQ ID NO:27: delta-opioid receptor [Mus musculus) ACCESSION 
AAA37522 

28. SEQ ID NO:28: Mus musculus delta-opioid receptor mRNA, cpl cds 
AccNo. L06322 

20 29. SEQ ID NO: 29: Homo sapiens Qiuman) kappa opioid receptor 

ACCESSION AAA63906 

30. SEQ ID NO: 30: Human Itappa opioid receptor (iiKOR) mRNA, 
complete cds Acc No. U17298 

31. SEQ ID NO: 31: kappa opioid receptor [Mus musculus] ACCESSION 
25 AAA39363 

32. SEQ H) NO: 32: Mouse kappa opioid receptor mRNA, complete cds 
ACCESSION L11065 

33. SEQ ID NO: 33: kappa opioid receptor [Rattus norvegicus] 
ACCESSION AAA41496 
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34, SEQ ID NO: 34: Rattus norvegicus mRNA for kappa opioid receptor, 
complete cds ACCESSION D16829 
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V. CLAIMS 

What is claimed is: 

1 . A vector fox delivering an opioid receptor to a nerve ceU, comprising sequence 
encoding an opioid recq)tor and a vector backbone. 

2. The vector of claim 1, wherein the opioid receptor is a (i-opioid receptor. 

3 . The vector of claim 2, wherein the ^i-opioid receptor has a sequence with at least 80% 
identity to the sequence set forth in SEQ ID NO: 1 . 

4. The vector of claim 2» wherem the ^-<ypioid receptor has a sequence with at least 85% 
identity to the sequence set forth in SEQ ID NO: 1 . 

5. The vector of claim 2, wherein the |i-opioid receptor has a sequence with at least 90% 
identity to the sequence set forth in SEQ ID NO: 1 . 

6. The vector of claim 2, wherein the |i-opioid receptor has a sequence with at least 95% 
identity to the sequence set forth in SEQ ID NO: 1. 

7. The vector of claims 1 -6, wherein the composition further comprises a promoter. 

8. The vector of claim 7, wherein the promoter is a nerve cell specific promoter. 

9. Ihe vector of claim 8, wherein the promoter is the neuron specific enolase promoter. 
The vector of claim 7, wherein the promoter is a NSE promoter. 

10. The vector of claim 7, wherem the vector backbone is a lentivital vector backbone. 

1 1 . The vector of claim 1 0, wherein the lentiviral vector backbone is a feline 
immunodeficiency vector (FIV), 

12. The vector of claim 1 1 , wherein the FIV has a sequence with at least 80% identity to 
flie sequence set forth in SEQ ID N0:7. 

13. A cell comprising the vector of claims 1-12. 

14. An animal comprising the cell of claim 13. 

1 5. A cell comprising the integrated product of the vector of claims 1-12. 

1 6. An animal comprising the cell of claim 1 S. 

17. A method of reducing pain in a subject, comprising administering the vector of 
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claims 1-12, to the subject, wherein the vector transduces a nerve cell. 

1 8. The meHiod of claim 1 7, wherein administering the vector occurs at the point of 
pain. 

19. The method of claim 17, wherein administering the vector occurs at the distal end of 
the nerve cell. 

20. The method of claim 17, wherein administering the vector occurs in the peripheral 
nervous system. 

21 . The method of claim 1 7, wherein administering the vector occurs at the axon or axon 
terminal of the nerve cell. 

22. The method of claim 17, wherein adnoinistering the vector occurs at the dendrite of 
the nerve cell. 

23. The method of claim 17, wherein administering the vector occurs at the trigeminal 
gangliorL 

24. The vector of claim 1, wherein the vector comprises the sequence set forth in S£Q 
IDNO:48. 

25. The vector of claim 1, wherein the vector comprises the sequence set fortb in SEQ 
ID NO: 7. 

26. A method of producing the vector of claim 1, comprising linking the opiod receptor 
sequence operably to a promoter. 

27. A method of producing the cell of claim 13, comprising transfecting the vector of 
claim 1 into the cell. 

28. An animal produced by the process of administering tiie vector of claim 1 to the 
animal. 

29. The vector of claim 7, wherein the vector backbone is an HTV vector backbone. 
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SEQUENCB LISTING 

<110> University of Rochester 

Kyxkanides, Stephanos 
Tallents, Ross H. 

<120> Treatment o£ Pain Through Expression o£ 
Opioid Receptors 

<130> 21108. 0022P1 

<150> 60/448,663 
<151> 2003-02-19 

<160> 54 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 400 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 1 






















Met 


Asp 


Ser 


Ser 


Ala 


Ala Pro Thr 


Asn 


Ala 


Ser 


Asn 


Cys Thr Asp 


Ala 


1 








5 






10 






15 




Leu 


Ala 


Tyr 


Ser 


Ser 


Cys Ser Pro 


Ala 


Pro 


Ser 


Pro 


Gly Ser Trp 


Val 








20 






25 








30 




Asn 


Leu 


Ser 


His 


Leu 


Asp Gly Asn 


Leu 


Ser 


Asp 


Pro 


Cys Gly Pro 


Asn 






35 






40 










45 




Arg 


Thr 


Asp 


Leu 


Gly 


Gly Arg Asp 


Ser 


Leu 


Cys 


Pro 


Pro Thr Gly 


Ser 




50 








55 








60 






Pro 


Ser 


Met 


He 


Thr 


Ala He Thr 


He 


Met 


Ala 


Leu 


Tyr Ser He 


Val 


65 










70 






75 






80 


Cys 


Val 


Val 


Gly 


Leu 


Phe Gly Asn 


Phe 


Leu 


Val 


Met 


Tyr Val He 


Val 










85 






90 






95 




Arg 


Tyr 


Thr 


Lys 


Met 


Lys Thr Ala 


Thr 


Asn 


He 


Tyr 


He Phe Asn 


Leu 








100 






105 








110 




Ala 


Leu 


Ala 


Asp 


Ala 


Leu Ala Thr 


Ser 


Thr 


Leu 


Pro 


Phe Gin Ser 


Val 






115 






120 










125 




Asn 


Tyr 


Leu 


Met 


Gly 


Thr Trp Pro 


Phe 


Gly 


Thr 


He 


Leu Cys Lys 


He 




13 0 








135 








140 






Val 


He 


Ser 


He 


Asp 


Tyr Tyr Asn 


Met 


Phe 


Thr 


Ser 


He Phe Thr 


Leu 


145 










150 






155 






160 


Cys 


Thr 


Met 


Ser 


Val 


Asp Arg Tyr 


He 


Ala 


Val 


Cys 


His Pro Val 


Lys 










165 






170 






175 




Ala 


Leu 


Asp 


Phe 


Arg 


Thr Pro Arg 


Asn 


Ala 


Lys 


He 


He Asn Val 


Cys 








180 






185 








190 




Asn 


Trp 


He 


Leu 


Ser 


Ser Ala He 


Gly 


Leu 


Pro 


Val 


Met Phe Met 


Ala 






195 






200 










205 




Thr 


Thr 


Lys 


Tyr 


Arg 


Gin Gly Ser 


He 


Asp 


Cys 


Thr 


Leu Thr Phe 


Ser 




210 








215 








220 






His 


Pro 


Thr 


Trp 


Tyr 


Trp Glu Asn 


Leu 


Leu 


Lys 


He 


Cys Val Phe 


He 


225 










230 






235 






240 



1753 
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Phe Ala 


Phe 


He Mefc Pro 


Val 

V CI. J. 


jjcsu xxc J- nr 


vax \jys 


Tyr 


Gxy 


ijeu 






245 




250 








Met lie 


Leu 


Rra Leu Lvs 


Ser 


Val 2kT*rr Moi" Tion 




Ser 


Lys 


vjXU 






260 




265 




^ f u 






Lvs AsD 


Arcr 


Asn Tjf^u JXt'ct 




Tl^ TTir* 2iT-rT MoV 
±JLC xXlI. AXy I'ueu 


vai jjeu 


vai 


va± 


vax 




275 






280 


p t; 








Ala Val 


Phe 


He Val Cvs 


Tm 


Thr Prn Tl<a Tl-i a 

AXXJL JrJuSJ Xl^o 


lie Tyr 


WSL± 




xxe 


290 






295 




J WW 








Lys Ala 


Leu 


Val Thr He 




f^l 11 Thi^ Thi" DVia 
XXIX xxix It lie 


oxn inr 


vax 


oer 


Trp 


305 




310 












320 


His Phe 




He Ala Tj^u 

^•A-W .r\JLOk JalwU 




XyX XUX ABU OCX 


^jys ijeu 


Asn 


Pro 


vax 






325 




J J 1/ 










Leu Tyr 


Ala 


Phe Leu Asp 


Glu 


Asn Phe Iivs Ara 


C^VC! 1511 A 


Arg 




irIlC 






340 




345 




350 






Cys lie 


Pro 


Thr Ser Ser 


Asn 


He Glu Gin Gin 


Asn Ser 


Thr 


Arg 


He 




355 






360 


365 






Arg Gin 


Asn 


Thr Arg Asp 


His 


Pro Ser Thr Ala 


Asn Thr 


Val 


Asp 


Arg 


370 






375 




380 




Thr Asn 


His 


Gin Leu Glu 


Asn 


Leu Glu Ala Glu 


Thr Ala 


Pro 


Leu 


Pro 


385 




390 




395 








400 



<210> 2 
<211> 1200 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 2 

atggacagca gcgctgcccc cacgaacgcc agcaattgca ctgatgcctt ggcgtactca 60 
agttgctccc cagcacccag ccccggttcc tgggtcaact tgtcccactt agatggcaac 120 
ctgtccgacc catgcggtcc gaaccgcacc gacctgggcg ggagagacag cctgtgccct 180 
ccgaccggca gtccctccat gatcacggcc atcacgatca tggccctcta ctccatcgtg 240 
tgcgtggtgg ggctcttcgg aaacttcctg gtcatgtatg tgattgtcag atacaccaag 300 
atgaagactg ccaccaacat ctacattttc aaccttgctc tggcagatgc cttagccacc 360 
agtaccctgc ccttccagag tgtgaattac ctaatgggaa catggccatt tggaaccatc 420 
ctttgcaaga tagtgatctc catagattac tataacatgt tcaccagcat attcaccctc 480 
tgcaccatga gtgttgatcg atacattgca gtctgccacc ctgtcaaggc cttagatttc 540 
cgtactcccc gaaatgccaa aattatcaat gtctgcaact ggatcctctc ttcagccatt 600 
ggtcttcctg taatgttcat ggctacaaca aaatacaggc aaggttccat agattgtaca 660 
ctaacattct ctcatccaac ctggtactgg gaaaacctgc tgaagatctg tgttttcatc 720 
ttcgccttca ttatgccagt gctcatcatt accgtgtgct atggactgat gatcttgcgc 780 
ctcaagagtg tccgcatgct ctctggctcc aaagaaaagg acaggaatct tcgaaggatc 840 
accaggatgg tgctggtggt ggtggctgtg ttcatcgtct gctggactcc cattcacatt 900 
tacgtcatca ttaaagcctt ggttacaatc ccagaaacta cgttccagac tgtttcttgg 960 
cacttctgca ttgctctagg ttacacaaac agctgcctca acccagtcct ttatgcattt 1020 
ctggatgaaa acttcaaacg atgcttcaga gagttctgta tcccaacctc ttccaacatt 1080 
gagcaacaaa actccactcg aattcgtcag aacactagag accacccctc cacggccaat X140 
acagtggata gaactaatca tcagctagaa aatctggaag cagaaactgc tccgttgccc 1200 



<210> 3 
<211> 398 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
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synthetic construct 



<400> 3 





Asp 


£>eir oer jhllo. 


vjiy 


Pro 


rai I*- 
oxy 


•1 
J. 




5 












Jrro axa oezr 


Trp 


Ser 


Pro 














Gov* 


JaXB 


vai iusp urj.y 


Aj3n 


\7in 


Ser 












yi A 




Leu 


vjiy oiy oer 


£lJ.S 


Ser 


Leu 




oU 










MAf- 


VclX 


liH: itJ,Si ±±Q 


IDjl 


Tl o 


Mat- 

jYiec 


oi> 






/ 0 






Vox 




jucu JrJac vsJiy 


Asn 


JrXie 


Leu 
















Lys 


Mai- y,'\ra I^Viv 

nsb itys xiu. 


Aia 


xnr 


A^n 






100 








Ala 


TV 

ASp 


Ala Leu Ala 


Tnr 


Ser 


Thr 






115 






120 


Ij€U 




UJLy Xxp 


Pro 


JrAe 


%3±y 




130 






<i o e 

13b 








ASp Ijri 


Asn 


nec 


Tail A 


145 






150 






Met 


Ser 


Val Asp Arg 


Tyr 


lie 


Ala 






165 








Asp 


c'ne 


Arg inr Jb'ro 


Arg 


Asn 


% 1 o 

Aia 






180 










Leu 


Ser Ser Ala 


lie 


oiy 


Leu 






195 






200 


ijys 


Tyr 


Arg Gin Giy 


Ser 


Xl A 

lie 


ASp 




210 






215 




Thr 


Trp 


Tyr Trp Glu 


Asn 


Leu 


Leu 


225 






230 






Jrne 


lie 


Ma^ nvr% XfaH 

neu fro vai 


Leu 


Tl a 

lie 


Tl a 

lie 






245 








Leu 


Arg 


Leu Lys Ser 


Val 


Arg 


Met 






260 










Asn 


Leu Arg Arg 


lie 


Thr 


Arg 






275 






280 


Pile 


lie 


Val Cys Trp 


Thr 


Pro 


Tl A 

lie 




290 






295 




Leu 


lie 


Thr He Pro 


Glu 


Thr 


Thr 


305 






310 






Cys 


He 


Ala Leu Gly 


Tyr 


Thr 


Asn 






325 








Ala 


Phe 


Leu Asp Glu 


Asn 


Phe 


Lys 






340 








Pro 


Thr 


Ser Ser Thr 


He 


Glu 


Gin 






355 






360 


Asn 


Thr 


Arg Glu His 


Pro 


Ser 


Thr 




370 






375 




His 


Gin 


Leu Glu Asn 


Leu 


Glu 


Ala 


385 






390 







Asn 




Ser 


Asp 


Cys 


Ser 


Asp 


Pro 




1 A 

lU 










lb 




ai a 


Pro 


oiy 


Ser 


Trp 


Leu 


Asn 


Leu 












Q A 
3 U 






Asp 


Pro 


Cys 


rai w 
v»iy 


Pro 


AkSn 


Arg 


mr 










4b 








Cys 


Pro 


vjin 


Thr 


t»iy 


C A V> 

oer 


Pro 


Ser 








oU 










Ala 


Leu 


Tyr 


Ser 


Tl A 

lie 


vai 


Cys 


vai 






75 










80 


vai 


Mec 


Tyr 


vai 


Tl A 

lie 


vai 


Arg 


Tyr 




a A 

90 










A C 

95 




Tl o 

lie 


Xyr 


Tl a 

lie 


iriie 


Asn 


Leu 


AX a 


Leu 


105 










110 






Leu 


Pro 


Pne 


Gin 


Ser 


Val 


Asn 


Tyr 










125 








Asn 


lie 


Leu 


cys 


Lys 


xl A 

lie 


vai 


Tl A 

lie 








1 Jl A 

140 










Thr 


Ser 


Tl A 

lie 


Jrne 


Thr 


Leu 


cys 


Thr 






155 










160 


vai 


Cys 


HIS 


Pro 


vai 


Lys 


7\ 1 ^ 

Ala 


Leu 




170 










175 




Lys 


Tl a 

lie 


vai 


Asn 


vai 


Cys 


Asn 


Trp 


185 










190 






Pro 


Val 


Met 


Pne 


Met 


Ala 


Thr 


Thr 










205 








Ovum 

cys 


Tnr 


Leu 


^ntir 


Pne 


Ser 


nXS 


Pro 








220 










Lys 


lie 


Cys 


Tr-*l 

val 


Pne 


xl «a 

lie 


Pne 


Ala 






235 










240 


Thr 


vai 


Cys 


Tyr 


Giy 


Leu 


nec 


xl A 

lie 




250 










255 




Leu 


Ser 


/al tr 

viiy 


Ser 


Lys 


r*i 11 
(jIU 


Lys 


Asp 


265 










270 






Met 


vai 


Leu 


vai 


vai 


vai 


Ala 


vai 










285 








TT ' _ 

His 


T 1 «=» 

lie 


Tyr 


val 


Tl A 

lie 


lie 


Lys 


Al !a 

Ala 








300 










Phe 


Gin 


Thr 


Val 


Ser 


Trp 


His 


Pne 






315 










320 


Ser 


Cys 


Leu 


Asn 


Pro 


Val 


Leu 


Tyr 




330 










335 




Arg 


Cys 


Phe 


Arg 


Glu 


Phe 


Cys 


He 


345 










350 






Gin 


Asn 


Ser 


Ala 


Arg 


He 


Arg 


Gin 










365 








Ala 


Asn 


Thr 


Val 


Asp 


Arg 


Thr 


Asn 








380 










Glu 


Thr 


Ala 


Pro 


Leu 


Pro 







395 



<210> 4 
<211> 1194 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 4 

atggacagca gcgccggccc agggaacatc agcgactgct ctgacccctt agctcctgca 60 
agttggtccc cagcacctgg ctcctggctc aacttgtccc acgttgatgg caaccagtcc 120 
gacccatgcg gtcctaaccg cacggggctt ggcgggagcc acagcctgtg ccctcagacc 180 
ggcagccctt ccatggtcac agccatcacc atcatggccc tctattctat cgtgtgtgta 240 
gtgggcctct ttggaaactt cctggtcatg tatgtgattg taagatatac caaaatgaag 300 
actgccacca acatctacat tttcaacctt gctctggcag atgccttagc cactagcacg 360 
ctgccctttc agagtgttaa ctacctgatg ggaacgtggc cctttggaaa catcctctgc 420 
aagatcgtga tctcaataga ctactacaac atgttcacca gtatcttcac cctctgcacc 480 
atgagtgtag accgctacat tgccgtctgc cacccggtca aggccctgga tttccgtacc 540 
ccccgaaatg ccaaaattgt caatgtctgc aactggatcc tctcttctgc cattggtctg 600 
cccgtaatgt tcatggcaac cacaaaatac aggcaggggt ccatagattg caccctcact 660 
ttctctcatc ccacatggta ctgggagaac ctgctcaaaa tctgtgtctt catcttcgcc 720 
ttcatcatgc cggtcctcat catcactgtg tgttatggac tgatgatctt acgactcaag 780 
agtgtccgca tgctgtcggg ctccaaagaa aaggacagga acctgcgcag gatcacccgg 840 
atggtgctgg tggtcgtggc tgtatttatt gtctgctgga cccccatcca catctatgtc 900 
atcatcaaag cactgatcac gattccagaa accactttcc agactgtttc ctggcacttc 960 
tgcattgcct tgggttacac aaacagctgc ctgaacccag ttctttatgc gttcctggat 1020 
gaaaacttca aacgatgttt tagagagttc tgcatcccaa cttcctccac aatcgaacag 1080 
caaaactctg ctcgaatccg tcaaaacact agggaacacc cctccacggc taatacagtg 1140 
gatcgaacta accaccagct agaaaatctg gaagcagaaa ctgctccatt gccc 1194 

<210> 5 

<211> 2423 - . .. 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 5 

acccgcgctc gtacgtgcgc 
catgcgcccg cgcggcccta 
cgccgccact gccactcccg 
actaccaccg tctgagtctg 
tgggcccggg agatcctgga 
gccaaaggtc ttttccgggc 
ctggagctga gggatggaga 
gaccacatca actccaccat 
caagagaaac tggacaacct 
ggggccaatg ccatcctggg 
gaactgcccc tgtatcgcca 
gtgccggcct tcaacgtgat 
gagttcatga tcctcccagt 
gaggtctacc atacactcaa 
gtgggggatg aaggtggctt 
gtgaaggaag ccatcgacaa 
gctgcctcag agttttatcg 
ccttcccgat acatcactgg 
tatcctgtgg tctccattga 
ttcacagcca atgtagggat 
cgtattgagc gggcagtgga 
atcggctcgg tcactgaagc 
gtcatggtga gtcatcgctc 
gggctgtgca caggccagat 
tacaaccagc tcatgagaat 
aacttccgta atcccagtgt 



ctccgccggc agctcctgac 
taggcgcctc ctccgcccgc 
ctctctcagc gccgccgtcg 
cagtcccgag atcccagcca 
ctcccgcggg aaccccacag 
tgcagtgccc agtggagcct 
caaacagcgt tacttaggca 
cgcgccagcc ctcatcagct 
gatgctggag ttggatggga 
tgtgtctctg gccgtgtgta 
cattgctcag ctggccggga 
caatggtggc tctcatgctg 
gggagctgag agctttcggg 
gggagtcatc aaggacaaat 
tgcccccaat atcctggaga 
ggctggctac acggaaaaga 
tgatggcaaa tatgacttgg 
ggaccagctg ggggcactct 
ggacccattt gaccaggatg 
ccagattgtg ggtgatgacc 
agaaaaggcc tgcaactgtc 
catccaagcg tgcaagctgg 
aggagagact gaggacacat 
caagactggt gccccgtgcc 
tgaggaagag ctgggggatg 
gctgtgattc ctctgcttgc 



tcatcggggg ctccgggtca 60 
cgcccgggag ccgcagccgc 120 
ccaccgccac cgccaccgcc 180 
tcatgtccat agagaagatc 240 
tggaggtgga tctctatact 300 
ctacgggcat ctatgaggcc 360 
aaggtgtcct gaaggcagtg 420 
caggtctctc tgtggtggag 480 
ctgagaacaa atccaagttt 540 
aggcaggggc agctgagcgg 600 
actcagacct catcctgcct 660 
gcaacaagct ggccatgcag 720 
atgccatgcg actaggtgca 780 
acggcaagga tgccaccaat 840 
acagtgaagc cttggagctg 900 
tcgttattgg catggatgtt 960 
acttcaagtc tcccactgat 1020 
accaggactt tgtcagggac 1080 
attgggctgc ctggtccaag 1140 
tgacagtgac caacccaaaa 1200 
tgctgctcaa ggtcaaccag 1260 
cccaggagaa tggctggggg 1320 
tcattgctga cctggtggtg 1380 
gttctgaacg tctggctaaa 1440 
aagctcgctt tgccggacat 1500 
ctggagacgt ggaacctctg 1560 
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tctcatcctc ctggaacctt gctgtcctga tctgtgatag ttcaccccct gagatcccct X620 
gagccccagg gtgcccagaa cttccctgat tgacctgctc cgctgctcct tggcttacct 1680 
gacctcttgc tgtctctgct cgccctcctt tctgtgccct actcattggg gttccgcact 1740 
ttccacttct tcctttctct ttctctcttc cctcagaaac tagaaatgtg aatgaggatt 1800 
attataaaag ggggtccgtg gaagaatgat cagcatctgt gatgggagcg tcagggttgg 1860 
tgtgctgagg tgttagagag ggaccatgtg tcacttgtgc tttgctcttg tcccacgtgt 1920 
cttccacttt gcatatgagc cgtgaactgt gcatagtgct gggatggagg ggagtgttgg 1980 
gcatgtgatc acgcctggct aataaggctt tagtgtattt atttatttat ttattttatt 2040 
tgtttttcat tcatcccatt aatcatttcc ccataactca atggcctaaa actggcctga 2100 
cttgggggaa cgatgtgtct gtatttcatg tggctgtaga tcccaagatg actggggtgg 2160 
gaggtcttgc tagaatggga agggtcatag aaagggcctt gacatcagtt cctttgtgtg 2220 
tactcactga agcctgcgtt ggtccagagc ggaggctgtg tgcctggggg agttttcctc 2280 
tatacatctc tccccaaccc taggttccct gttcttcctc cagctgcacc agagcaacct 2340 
ctcactcccc atgccacgtt ccacagttgc caccacctct gtggcattga aatgagcacc 2400 
tccattaaag tctgaatcag tgc 2423 

<210> 6 
<211> 1773 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 

synthetic construct * ■ . 

<400> 6 

cdgaggagcc tgcgctgctc ctggctcaca gcgctccggg cgaggagagc gggcggaccg 60 
gggggctggg ccggtgcggg cggcgaggca ggcggacgag gcgcagagac agcggggcgg 120 
ccggggcgcg gcacgcggcg ggtcggggcc ggcctctgcc ttgccgctcc cctcgcgtcg 180 
gatccccgcg cccaggcagc cggtggagag ggacgcggcg gacgccggca gccatggaac 240 
cggccccctc cgccggcgcc gagctgcagc ccccgctctt cgccaacgcc tcggacgcct 300 
accctagcgc cttccccagc gctggcgcca atgcgtcggg gccgccaggc gcgcggagcg 360 
cctcgtccct cgccctggca atcgccatca ccgcgctcta ctcggccgtg tgcgccgtgg 420 
ggctgctggg caacgtgctt gtcatgttcg gcatcgtccg gtacactaag atgaagacgg 480 
ccaccaacat ctacatcttc aacctggcct tagccgatgc gctggccacc agcacgctgc 540 
ctttccagag tgccaagtac ctgatggaga cgtggccctt cggcgagctg ctctgcaagg 600 
ctgtgctctc catcgactac tacaatatgt tcaccagcat cttcacgctc accatgatga 660 
gtgttgaccg ctacatcgct gtctgccacc ctgtcaaggc cctggacttc cgcacgcctg 720 
ccaaggccaa gctgatcaac atctgtatct gggtcctggc ctcaggcgtt ggcgtgccca 780 
tcatggtcat ggctgtgacc cgtccccggg acggggcagt ggtgtgcatg ctccagttcc 840 
ccagccccag ctggtactgg gacacggtga ccaagatctg cgtgttcctc ttcgccttcg 900 
tggtgcccat cctcatcatc accgtgtgct atggcctcat gctgctgcgc ctgcgcagtg 960 
tgcgcctgct gtcgggctcc aaggagaagg accgcagcct gcggcgcatc acgcgcatgg 1020 
tgctggtggt tgtgggcgcc ttcgtggtgt gttgggcgcc catccacatc ttcgtcatcg 1080 
tctggacgct ggtggacatc gaccggcgcg acccgctggt ggtggctgcg ctgcacctgt 1140 
gcatcgcgct gggctacgcc aatagcagcc tcaaccccgt gctctacgct ttcctcgacg 1200 
agaacttcaa gcgctgcttc cgccagctct gccgcaagcc ctgcggccgc ccagacccca 1260 
gcagcttcag ccgcgcccgc gaagccacgg cccgcgagcg tgtcaccgcc tgcaccccgt 1320 
ccgatggtcc cggcggtggc gctgccgcct gaccaggcca tccggccccc agacgcccct 1380 
ccctagttgt acccggaggc cacatgagtc ccagtgggag gcgcgagcca tgatgtggag 1440 
tggggccagt agataggtcg gagggctttg ggaccgccag atggggcctc tgtttcggag 1500 
acgggaccgg gccgctagat gggcatgggg tgggcctctg gtttggggcg aggcagagga 1560 
cagatcaatg gcgcagtgcc tctggtctgg gtgcccccgt ccacggctct aggtggggcg 1620 
ggaaagccag tgactccagg agaggagcgg gacctgtggc tctacaactg agtccttaaa 1680 
cagggcatct ccaggaaggc ggggcttcaa ccttgagaca gcttcggttt ctaacttgga 1740 
gccggacttt cggagttggg gggtccgggg ccc 1773 

<210> 7 
<211> 9426 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: /note = 
synthetic construct 

<400> 7 

atgcggtttt ggcagtacat caatgggcgt ggatagcggt ttgactcacg gggatttcca 60 
agtctccacc ccattgacgt caatgggagt ttgttttggc accaaaatca acgggacttt 120 
ccaaaatgtc gtaacaactc cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg 180 
gaggtctata taagcagagc tctgtgaaac ttcgaggagt ctctttgttg aggacttttg 240 
agttctccct tgaggctccc acagatacaa taaatatttg agattgaacc ctgtcgagta 300 
tctgtgtaat cttttttacc tgtgaggtct cggaatccgg gccgagaact tcgcagttgg 360 
cgcccgaaca gggacttgat tgagagtgat tgaggaagtg aagctagagc aatagaaagc 420 
tgttaagcag aactcctgct gacctaaata gggaagcagt agcagacgct gctaacagtg^ 480 
agtatctcta gtgaagcgga ctcgagctca taatcaagtc attgtttaaa ggcccagata' 540 
aattacatct ggtgactctt cgcggacctt caagccagga gattcgccga gggacagtca 600 
acaaggtagg agagattcta cagcaacatg gggaatggac aggggcgaga ttggaaaatg 660 
gccattaaga gatgtagtaa tgttgctgta ggagtagggg ggaagagtaa aaaatttgga 720 
gaagggaatt tcagatgggc cattagaatg gctaatgtat ctacaggacg agaacctggt 780 
gatataccag agactttaga tcaactaagg ttggttattt gcgatttaca agaaagaaga 840 
gaaaaatttg gatctagcaa agaaattgat atggcaattg tgacattaaa agtctttgcg 900 
gtagcaggac ttttaaatat gacgggtgtc tactgctgct gcagctgaaa atatgtattc 960 
tcaaatggga ttagacacta ggccatctat gaaagaagca ggtggaaaag aggaaggccc 1020 
tccacaggca tatcctattc aaacagtaaa tggagtacca caatatgtag cacttgaccc 1080 
aaaaatggtg tccattttta tggaaaaggc aagagaagga ctaggaggtg aggaagttca 1140 
actatggttt actgccttct ctgcaaattt aacacctact gacatggcca cattaataat 1200 
ggccgcacea gggtgcgctg cagataaaga aatattggat gaaagcttaa agcaactgac 1260 
agcagaatat gatcgcacac atccccctga tgctcccaga ccattaccct attttactgc 1320 
agcagaaatt atgggtatag gattaactca agaacaacaa gcagaagcaa gatttgcacc 1380 
agctaggatg cagtgtagag catggtatct cgaggcatta ggaaaattgg ctgccataaa 1440 
agctaagtct cctcgagctg tgcagttaag acaaggagct aaggaagatt attcatcctt 1500 
tatagacaga ttgtttgccc aaatagatca agaacaaaat acagctgaag ttaagttata 1560 
tttaaaacag tcattgagca tagctaatgc taatgcagac tgtaaaaagg caatgagcca 1620 
ccttaagcca gaaagtaccc tagaagaaaa gttgagagct tgtcaagaaa taggctcacc 1680 
aggatataaa atgcaactct tggcagaagc tcttacaaaa gttcaagtag tgcaatcaaa 1740 
aggatcagga ccagtgtgtt ttaattgtaa aaaaccagga catctagcaa gacaatgtag 1800 
agaagtgaaa aaatgtaata aatgtggaaa acctggtcat gtagctgcca aatgttggca 1860 
aggaaataga aagaattgta caagggaaga aagggataca acaattacaa aagtgggaag 1920 
attgggtagg atggatagga aatattccac aatatttaaa gggactattg ggaggtatct 1980 
tgggaatagg attaggagtg ttattattga ttttatgttt acctacattg gttgattgta 2 040 
:taagaaattg tatccacaag atactaggat acacagtaat tgcaatgcct gaagtagaag 2100 
gagaagaaat acaaccacaa atggaattga ggagaaatgg taggcaatgt ggcatgtctg 2160 
aaaaagagga ggaatgatga agtatctcag acttatttta taagggagat actgtgctga 2220 
gttcttccct ttgaggaagg tatgtcatat gaatccattt cgaaatcaaa ttcctcctct 2280 
gctcgcccaa tccttccaac cccctatggt ggtatggctg acacagaaaa tgtctgctcc 2340 
tgtatgggac atttgcccct cttctccaaa tataagacag gatgaggcct agcttttgct 2400 
gctccaaagt tttaaaagaa cacattgcac ggcatttagg gactctaaag ggtggaggag 2460 
gaatgaggga attgcatcat gccaaggctg gtcctcatcc atcactgctt ccagggccca 2520 
gagtggcttc caggaggtat tcttacaaag gaagcccgat ctgtagctaa cactcagagc 2580 
ccattttcct gcgttaaccc ctcccgacct catatacagg agtaacatga tcagtgacct 2640 
gggggagctg gccaaactgc gggacctgcc caagctgagg gccttggtgc tgctggacaa 2700 
cccctgtgcc gatgagactg actaccgcca ggaggccctg gtgcagatgg cacacctaga 2760 
gcgcctagac aaagagtact atgaggacga ggaccgggca gaagctgagg agatccgaca 2820 
gaggctgaag gaggaacagg agcaagaact cgacccggac caagacatgg aaccgtacct 2880 
cccgccaact tagtggctcc tctagcctgc agggacagta aaggtgatgg caggaaggca 2940 
gcccccggag gtcaaaggct gggcacgcgg gaggagaggc cagagtcaga ggctgcgggt 3000 
atctcagata tgaaggaaag atgagagagg ctcaggaaga ggtaagaaaa gacacaagag 3060 
accagagaag ggagaagaat tagagaggga ggcagaggac cgctgtctct acagacatag 3120 
ctggtagaga ctgggaggaa gggatgaacc ctgagcgcat gaagggaagg aggtggctgg 3180 
tggtatatgg aggatgtagc tgggccaggg aaaagatcct gcactaaaaa tctgaagcta 3240 
aaaataacag gacacggggt ggagaggcga aaggagggca gattgaggca gagagactga 3300 
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gaggcctggg gatgtgggca ttccggtagg gcacacagtt cacttgtctt ctctttttcc 3360 
aggaggccar agatgctgac ctcaagaact cataataccc cagtggggac caccgcattc 3420 
atagccctgt tacaagaagt gggagatgtt cctttttgtc ccagactgga aatccattac 3480 
atcccgaggc tcaggttctg tggtggtcat ctctgtgtgg cttgttctgt gggcctacct 3540 
aaagtcctaa gcacagctct caagcagatc cgaggcgact aagatgctag taggggttgt 3600 
ctggagagaa gagccgagga ggtgggctgt gatggatcag ttcagctttc aaataaaaag 3660 
gcgtttttat attctgtgtc gagttcgtga acccctgtgg tgggcttctc catctgtctg 3720 
ggttagtacc tgccactata ctggaataag gggacgcctg cttccctcga gttggctgga 3780 
caaggttatg agcatccgtg tacttatggg gttgccagct tggtcctgga tcgcccgggc 3840 
ccttccccca cccgttcggt tccccaccac cacccgcgct cgtacgtgcg tctccgcctg 3900 
cagctcttga ctcatcgggg cccccgggtc acatgcgctc gctcggctct ataggcgccg 3960 
ccccctgccc accccccgcc cgcgctggga gccgcagccg ccgccactcc tgctctctct 4020 
gcgccgccgc cgtcaccacc gccaccgcca ccggctgagt ctgcagtcct cgaggaactg 4080 
aaaaaccaga aagttaactg gtaagtttag tctttttgtc ttttatttca ggtcccggat 4140 
ccggtggtgg tgcaaatcaa agaactgctc ctcagtggat gttgccttta cttctaggcc 4200 
tgtacggaag tgttacttct gctctaaaag ctgcggaatt gtacccgcgg ccaagctaag 4260 
cttgatatcg aattccggat gagcctctgt gaactactaa ggtgggaggg ggctatacgc 4320 
agaggagaat gtcagatgct cagctcggtc ccctccgcct gacgctcctc tctgtctcag 4380 
ccaggactgg tttctgtaag aaacagcagg agctgtggca gcggcgaaag gaagcggctg 4440 
aggcgcttgg aacccgaaaa gtctcggtgc tcctggctac ctcgcacagc ggtgcccgcc 4500 
cggccgtcag taccatggac agcagcgctg cccccacgaa cgccagcaat tgcactgatg 4560 
ccttggcgta ctcaagttgc tccccagcac ccagccccgg ttcctgggtc aacttgtccc 4620 
acttagatgg caacctgtcc gacccatgcg gtccgaaccg caccgacctg ggcgggagag 4680 
acagcctgtg ccctccgacc ggcagtccct ccatgatcac ggccatcacg atcatggccc 4740 
tctactccat cgtgtgcgtg gtggggctct tcggaaactt cctggtcatg tatgtgattg 4800 
tcagatacac caagatgaag actgccacca acatctacat tttcaacctt gctctggcag 4860 
atgccttagc caccagtacc ctgcccttcc agagtgtgaa ttacctaatg ggaacatggc 4920 
catttggaac catcctttgc aagatagtga tctccataga ttactataac atgttcacca 4980 
gcatattcac cctctgcacc atgagtgttg atcgatacat tgcagtctgc caccctgtca 5040 
aggccttaga tttccgtact ccccgaaatg ccaaaattat caatgtctgc aactggatcc 5100 
tctcttcagc cattggtctt cctgtaatgt tcatggctac aacaaaatac aggcaaggtt 5160 
ccatagattg tacactaaca ttctctcatc caacctggta ctgggaaaac ctgctgaaga 5220 
tctgtgtttt catcttcgcc ttcattatgc cagtgctcat cattaccgtg tgctatggac 5280 
tgatgatctt gcgcctcaag agtgtccgca tgctctctgg ctccaaagaa aaggacagga 5340 
atcttcgaag gatcaccagg atggtgctgg tggtggtggc tgtgttcatc gtctgctgga 5400 
ctcccattca catttacgtc atcattaaag ccttggttac aatcccagaa actacgttcc 5460 
agactgtttc ttggcacttc tgcattgctc taggttacac aaacagctgc ctcaacccag 5520 
tcctttatgc atttctggat gaaaacttca aacgatgctt cagagagttc tgtatcccaa 5580 
cctcttccaa cattgagcaa caaaactcca ctcgaattcg tcagaacatc tagagaccac 5640 
ccctccacgg ccaatacagt ggatagaact aatcatcagc tagaaaatct ggaagcagaa 5700 
actgctccgt tgccctagag ctcgctgatc agcctcgact gtgccttcta gttgccagcc 5760 
atctgttgtt tgcccctccc ccgtgccttc cttgaccctg gaaggtgcca ctcccactgt 5820 
cctttcctaa taaaatgagg aaattgcatc gcattgtctg agtaggtgtc attctattct 5880 
ggggggtggg gtggggcagg acagcaaggg ggaggattgg gaagacaata gcaggcatgc 5940 
tggggtaaaa aagaaaaaag ggtggactgg gatgagtatt ggaaccctga agaaatagaa 6000 
agaatgctta tggactaggg actgtttacg aacaaatgat aaaaggaaat agctgagcat 6060 
gactcatagt taaagcgcta gcagctgcct aaccgcaaaa ccacatccta tggaaagctt 6120 
gctaatgacg tataagttgt tccattgtaa gagtatataa ccagtgcttt gtgaaacttc 6180 
gaggagtctc tttgttgagg acttttgagt tctcccttga ggctcccaca gatacaataa 6240 
atatttgaga ttgaaccctg tcgagtatct gtgtaatctt ttttacctgt gaggtctcgg 6300 
aatccgggcc gagaacttcg cagcggccgc tcatgaccga ccaagcgacg cccaacctgc 6360 
catcacgaga tttcgattcc accgccgcct tctatgaaag gttgggcttc ggaatcgttt 6420 
tccgggacgc cggctggatg atcctccagc gcggggatct catgctggag ttcttcgccc 6480 
accccaactt gtttattgca gcttataatg gttacaaata aagcaatagc atcacaaatt 6540 
tcacaaataa agcatttttt tcactgcatt ctagttgtgg tttgtccaaa ctcatcaatg 6600 
tatcttatca tgtctggatc ccgtcgacct cgagagcttg gcgtaatcat ggtcatagct 6660 
gtttcctgtg tgaaattgtt atccgctcac aattccacac aacatacgag ccggaagcat 6720 
aaagtgtaaa gcctggggtg cctaatgagt gagctaactc acattaattg cgttgcgctc 6780 
actgcccgct ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa tcggccaacg 6840 
cgcggggaga ggcggtttgc gtattgggcg ctcttccgct tcctcgctca ctgactcgct 6900 
gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt 6960 
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atccacagaa tcaggggata acgcaggaaa 
caggaaccgt aaaaaggccg cgttgctggc 
gcatcacaaa aatcgacgct caagtcagag 
ccaggcgttt ccccctggaa gctccctcgt 
cggatacctg tccgcctttc tcccttcggg 
taggtatctc agttcggtgt aggtcgttcg 
cgttcagccc gaccgctgcg ccttatccgg 
acacgactta tcgccactgg cagcagccac 
aggcggtgct acagag.ttct tgaagtggtg 
atttggtatc tgcgctctgc tgaagccagt 
atccggcaaa caaaccaccg ctggtagcgg 
gcgcagaaaa aaaggatctc aagaagatcc 
gtggaacgaa aactcacgtt aagggatttt 
ctagatcctt ttaaattaaa aatgaagttt 
ttggtctgac agttaccaat gcttaatcag 
tcgttcatcc atagttgcct gactccccgt 
accatctggc cccagtgctg caatgatacc 
atcagcaata aaccagccag ccggaagggc 
cgcctccatc cagtctatta attgttgccg 
tagtttgcgc aacgttgttg ccattgctac 
tatggcttca ttcagctccg gttcccaacg 
gtgcaaaaaa gcggttagct ccttcggtcc 
agtgttatca ctcatggtta tggcagcact 
aagatgcttt tctgtgactg gtgagtactc 
gcgaccgagt tgctcttgcc cggcgtcaat 
tttaaaagtg ctcatcattg gaaaacgttc 
gctgttgaga tccagttcga tgtaacccac 
tactttcacc agcgtttctg ggtgagcaaa 
aataagggcg acacggaaat gttgaatact 
catttatcag ggttattgtc tcatgagcgg 
acaaataggg gttccgcgca catttccccg 
agatctcccg atcccctatg gtcgactctc 
agccagtatc tgctccctgc ttgtgtgttg 
taagctacaa caaggcaagg cttgaccgac 
cgttttgcgc tgcttcgcga tgtacgggcc 
agttattaat agtaatcaat tacggggtca 
gttacataac ttacggtaaa tggcccgcct 
acgtcaataa tgacgtatgt tcccatagta 
tgggtggact atttacggta aactgcccac 
agtacgcccc ctattgacgt caatgacggt 
atgaccttat gggactttcc tacttggcag 
atggtg 



gaacatgtga gcaaaaggcc agcaaaaggc 7020 
gtttttccat aggctccgcc cccctgacga 7080 
gtggcgaaac ccgacaggac tataaagata 7140 
gcgctctcct gttccgaccc tgccgcttac 7200 
aagcgtggcg ctttctcaat gctcacgctg 7260 
ctccaagctg ggctgtgtgc acgaaccccc 7320 
taactatcgt cttgagtcca acccggtaag 7380 
tggtaacagg attagcagag bgaggtatgt 7440 
gcctaactac ggctacacta gaaggacagt 7500 
taccttcgga aaaagagttg gtagctcttg 7560 
tggttttttt gtttgcaagc agcagattac 7620 
tttgatcttt tctacggggt ctgacgctca 7680 
ggtcatgaga ttatcaaaaa ggatcttcac 7740 
taaatcaatc taaagtatat atgagtaaac 7800 
tgaggcacct atctcagcga tctgtctatt 7860 
cgtgtagata actacgatac gggagggctt 7920 
gcgagaccca cgctcaccgg ctccagattt 7980 
cgagcgcaga agtggtcctg caactttatc 8040 
ggaagctaga gtaagtagtt cgccagttaa 8100 
aggcatcgtg gtgtcacgct cgtcgtttgg 8160 
atcaaggcga gttacatgat cccccatgtt 8220 
tccgatcgtt gtcagaagta agttggccgc 8280 
gcataattct cttactgtca tgccatccgt 8340 
aaccaagtca ttctgagaat agtgtatgcg 8400 
acgggataat accgcgccac atagcagaac 8460 
ttcggggcga aaactctcaa ggatcttacc 8520 
tcgtgcaccc aactgatctt cagcatcttt 8580 
aacaggaagg caaaatgccg caaaaaaggg 8640 
catactcttc ctttttcaat attattgaag 8700 
atacatattt gaatgtattt agaaaaataa 8760 
aaaagtgcca cctgacgtcg acggatcggg 8820 
agtacaatct gctctgatgc cgcatagtta 8880 
gaggtcgctg agtagtgcgc gagcaaaatt 8940 
aattgcatga agaatctgct tagggttagg 9000 
agatatacgc gttgacattg attattgact 9060 
ttagttcata gcccatatat ggagttccgc 9120 
ggctgaccgc ccaacgaccc ccgcccattg 9180 
acgccaatag ggactttcca ttgacgtcaa 9240 
ttggcagtac atcaagtgta tcatatgcca 9300 
aaatggcccg cctggcatta tgcccagtac 9360 
tacatctacg tattagtcat cgctattacc 9420 

9426 



<210> 8 
<211> 12745 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 8 

atgcggtttt ggcagtacat caatgggcgt ggatagcggt ttgactcacg gggatttcca 60 
agtctccacc ccattgacgt caatgggagt ttgttttggc accaaaatca acgggacttt 120 
ccaaaatgtc gtaacaactc cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg 180 
gaggtctata taagcagagc tctgtgaaac ttcgaggagt ctctttgttg aggacttttg 240 
agttctccct tgaggctccc acagatacaa taaatatttg agattgaacc ctgtcgagta 300 
tctgtgtaat cttttttacc tgtgaggtct cggaatccgg gccgagaact tcgcagttgg 360 
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cgcccgaaca gggacttgat tgagagtgat 
tgttaagcag aactcctgct gacctaaata 
agtatctcta gtgaagcgga ctcgagctca 
aattacatct ggtgactctt cgcggacctt 
acaaggtagg agagattcta cagcaacatg 
gccattaaga gatgtagtaa tgttgctgta 
gaagggaatt tcagatgggc cattagaatg 
gatataccag agactttaga tcaactaagg 
gaaaaatttg gatctagcaa agaaattgat 
gtagcaggac ttttaaatat gacgggtgtc 
tcaaatggga ttagacacta ggccatctat 
tccacaggca tatcctattc aaacagtaaa 
aaaaatggtg tccattttta tggaaaaggc 
actatggttt actgccttct ctgcaaattt 
ggccgcacca gggtgcgctg cagataaaga 
agcagaatat gatcgcacac atccccctga 
agcagaaatt atgggtatag gattaactca 
agctaggatg cagtgtagag catggtatct 
agctaagtct cctcgagctg tgcagttaag 
tatagacaga ttgtttgccc aaatagatca 
tttaaaacag tcattgagca tagctaatgc 
ccttaagcca gaaagtaccc tagaagaaaa 
aggatataaa atgcaactct tggcagaagc 
aggatcagga ccagtgtgtt ttaattgtaa 
agaagtgaaa aaatgtaata aatgtggaaa 
aggaaataga aagaattgta caagggaaga 
attgggtagg atggatagga aatattccac 
tgggaatagg attaggagtg ttattattga 
taagaaattg tatccacaag atactaggat 
gagaagaaat acaaccacaa atggaattga 
aaaaagagga ggaatgatga agtatctcag 
gttcttccct ttgaggaagg tatgtcatat 
aagtatgtat tgtaaggtaa aaggaaaaga 
agtacattta tattggctca tgtccaatat 
gttattaata gtaatcaatt acggggtcat 
ttacataact tacggtaatt ggcccgcctg 
gtcaataatg acgtatgttc ccatagtaac 
ggtggagtat ttacggtaaa ctgcccactt 
tccggccccc tattgacgtc aatgacggta 
tgaccttacg ggactttggt acttggcagt 
tggtgatgcg gttttggcag tacaccaatg 
ttccaagtct ccaccccatt gacgtcaatg 
actttccaaa atgtcgtaat aaccccgccc 
ggtgggaggt ctatataagc agagctcgtt 
atccacgctg ttttgacctc catagaagac 
aacggtgcat tggaacgcgg attccccgtg 
tctataggca cacccctttg gctcttatgc 
acacccccgc tccttatgct ataggtgatg 
gaccattatt gaccactccc ctattggtga 
gctctttgcc acaactatct ctattggcta 
cacggactct gtatttttac aggatggggt 
aacaacgccg tcccccgtgc ccgcagtttt 
atctcgggta cgtgttccgg acatgggctc 
gagccctggt cccatgcctc cagcggctca 
gtggaggcca gacttaggca cagcacaatg 
gtggcggtag ggtatgtgtc tgaaaatgag 
atggaagact taaggcagcg gcagaagaag 
aagagtcaga ggtaactccc gttgcggttc 
cagtactcgt tgctgccgcg cgcgccacca 
tcctttccat gggtcttttc tgcagtcacc 
ttcactggcc gtcgttttac aacgtcgtga 



tgaggaagtg aagctagagc aatagaaagc 420 
gggaagcagt agcagacgct gctaacagtg 480 
taatcaagtc attgtttaaa ggcccagata 540 
caagccagga gattcgccga gggacagtca 600 
gggaatggac aggggcgaga ttggaaaatg 660 
ggagtagggg ggaagagtaa aaaatttgga 720 
gctaatgtat ctacaggacg agaacctggt 780 
ttggttattt gcgatttaca agaaagaaga 840 
atggcaattg tgacattaaa agtctttgcg 900 
tactgctgct gcagctgaaa atatgtattc 960 
gaaagaagca ggtggaaaag aggaaggccc 1020 
tggagtacca caatatgtag cacttgaccc 1080 
aagagaagga ctaggaggtg aggaagttca 1140 
aacacctact gacatggcca cattaataat 1200 
aatattggat gaaagcttaa agcaactgac 1260 
tgctcccaga ccattaccct attttactgc 1320 
agaacaacaa gcagaagcaa gatttgcacc 1380 
cgaggcatta ggaaaattgg ctgccataaa 1440 
acaaggagct aaggaagatt attcatcctt 1500 
agaacaaaat acagctgaag ttaagttata 1560 
taatgcagac tgtaaaaagg caatgagcca 1620 
gttgagagct tgtcaagaaa taggctcacc 1680 
tcttacaaaa gttcaagtag tgcaatcaaa 1740 
aaaaccagga catctagcaa gacaatgtag 1800 
acctggtcat gtagctgcca aatgttggca 1860 
aagggataca acaattacaa aagtgggaag 1920 
aatatttaaa gggactattgggaggtatct 1980 
ttttatgttt acctacattg gttgattgta 2040 
acacagtaat tgcaatgcct gaagtagaag 2100 
ggagaaatgg taggcaatgt ggcatgtctg 2160 
acttatttta taagggagat actgtgctga 2220 
gaatccattt cgaatcaaat caaactaata 2280 
caaagaagaa gaagaaagaa gaaagccttc 2340 
gaccgccatg ttgacattga ttattgacta 2400 
tagttcatag cccatatatg gagttccgcg 2460 
ctgaccgccc aacgaccccc gcccattgac 2520 
gccaataggg actttccatt gacgtcaatg 2580 
ggcagtacat caagtgtatc atatgccaag 2640 
aatggcccgc ctggcattat gcccagtaca 2700 
acatctacgt attagtcatc gctattacca 2760 
ggcgtggata gcggtttgac tcacggggat 2820 
ggagtttgtt ttggcaccaa aatcaacggg 2880 
cgttgacgca aatgggcggt aggcgtgtac 2940 
tagtgaaccg tcagatcgcc tggagacgcc 3000 
accgggaccg atccagcctc cgcggccggg 3060 
ccaagagtga cgtaagtacc gcctatagac 3120 
atgctatact gtttttggct tggggcctat 3180 
gtatagctta gcctataggt gtgggttatt 3240 
cgatactttc cattactaat ccataacatg 3300 
tatgccaata ctctgtcctt cagagactga 3360 
cccatttatt atttacaaat tcacatatac 3420 
tattaaacat agcgtgggat ctccacgcga 3480 
ttctccggta gcggcggagc ttccacatcc 3540 
tggtcgctcg gcagctcctt gctcctaaca 3600 
cccaccacca ccagtgtgcc gcacaaggcc 3660 
ctcggagatt gggctcgcac cgtgacgcag 3720 
atgcaggcag ctgagttgtt gtattctgat 3780 
tgttaacggt ggagggcagt gtagtctgag 3840 
gacataatag ctgacagact aacagactgt 3900 
gtcgtcgaag cttatgacca tgattacgga 3960 
ctgggaaaac cctggcgtta cccaacttaa 4020 
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tcgccttgca gcacatcccc ctttcgccag 
tcgcccttcc caacagttgc gcagcctgaa 
accagaagcg gtgccggaaa gctggctgga 
cgtcccctca aactggcaga tgcacggtta 
tcccattacg gtcaatccgc cgtttgttcc 
cacatttaat gttgatgaaa gctggctaca 
cgttaactcg gcgtttcatc tgtggtgcaa 
tcgtttgccg tctgaatttg acctgagcgc 
ggtgatggtg ctgcgttgga gtgacggcag 
gagcggcatt ttccgtgacg tctcgttgct 
ccatgttgcc actcgcttta atgatgattt 
gatgtgcggc gagttgcgtg actacctacg 
gcaggtcgcc agcggcaccg cgcctttcgg 
tgccgatcgc gtcacactac gtctgaacgt 
cccgaatctc tatcgtgcgg tggttgaact 
agaagcctgc gatgtcggtt tccgcgaggt 
cggcaagccg ttgctgattc gaggcgttaa 
ggtcatggat gagcagacga tggtgcagga 
cgccgtgcgc tgttcgcatt atccgaacca 
cggcctgtat gtggtggatg aagccaatat 
tctgaccgat gatccgcgct ggctaccggc 
gcgcgatcgt aatcacccga gtgtgatcat 
cgctaatcac gacgcgctgt atcgctggat 
gtatgaaggc ggcggagccg acaccacggc 
cgtggatgaa gaccagccct tcccggctgt 
gctacctgga gagacgcgcc cgctgatcct 
tcttggcggt ttcgctaaat actggcaggc 
cttcgtctgg gactgggtgg atcagtcgct 
gtcggcttac ggcggtgatt ttggcgatac 
tctggtcttt gccgaccgca cgccgcatcc 
gtttttccag ttccgtttat ccgggcaaac 
tcatagcgat aacgagctcc tgcactggat 
cggtgaagtg cctctggatg tcgctccaca 
accgcagccg gagagcgccg ggcaactctg 
gaccgcatgg tcagaagccg ggcacatcag 
cctcagtgtg acgctccccg ccgcgtccca 
ggatttttgc atcgagctgg gtaataagcg 
ttcacagatg tggattggcg ataaaaaaca 
ccgtgcaccg ctggataacg acattggcgt 
ctgggtcgaa cgctggaagg cggcgggcca 
cacggcagat acacttgctg atgcggtgct 
ggggaaaacc ttatttatca gccggaaaac 
gattaccgtt gatgttgaag tggcgagcga 
ctgccagctg gcgcaggtag cagagcgggt 
ctatcccgac cgccttactg ccgcctgttt 
gtataccccg tacgtcttcc cgagcgaaaa 
ttatggccca caccagtggc gcggcgactt 
gcaactgatg gaaaccagcc atcgccatct 
tatcgacggt ttccatatgg ggattggtgg 
ggaattccag ctgagcgccg gtcgctacca 
cgatcgacca gagctgagat cctacaggag 
gatgatgaca gagttagaag atcgcttcag 
agacagcaca gtagattctg aagatgaacc 
tgagtattgg aaccctgaag aaatagaaag 
caaatgataa aaggaaatag ctgagcatga 
ccgcaaaacc acatcctatg gaaagcttgc 
gtatataacc agtgctttgt gaaacttcga 
tcccttgagg ctcccacaga tacaataaat 
gtaatctttt ttacctgtga ggtctcggaa 
gagcatgcat ctagagggcc ctattctata 
agcctcgact gtgccttcta gttgccagcc 



ctggcgtaat agcgaagagg cccgcaccga 4080 
tggcgaatgg cgctttgcct ggtttccggc 4140 
gtgcgatctt cctgaggccg atactgtcgt 4200 
cgatgcgccc atctacacca acgtaaccta 4260 
cacggagaat ccgacgggtt gttactcgct 4320 
ggaaggccag acgcgaatta tttttgatgg 4380 
cgggcgctgg gtcggttacg gccaggacag 4440 
atttttacgc gccggagaaa accgcctcgc 4500 
ttatctggaa gatcaggata tgtggcggat 4560 
gcataaaccg actacacaaa tcagcgattt 4620 
cagccgcgct gtactggagg ctgaagttca 4680 
ggtaacagtt tctttatggc agggtgaaac 4740 
cggtgaaatt atcgatgagc gtggtggtta 4800 
cgaaaacccg aaactgtgga gcgccgaaat 4860 
gcacaccgcc gacggcacgc tgattgaagc 4920 
gcggattgaa aatggtctgc tgctgctgaa 4980 
ccgtcacgag catcatcctc tgcatggtca 5040 
tatcctgctg atgaagcaga acaactttaa 5100 
tccgctgtgg tacacgctgt gcgaccgcta 5160 
tgaaacccac ggcatggtgc caatgaatcg 5220 
gatgagcgaa cgcgtaacgc gaatggtgca 5280 
ctggtcgctg gggaatgaat caggccacgg 5340 
caaatctgtc gatccttccc gcccggtgca 5400 
caccgatatt atttgcccga tgtacgcgcg 5460 
gccgaaatgg tccatcaaaa aatggctttc 5520 
ttgcgaatac gcccacgcga tgggtaacag 5580 
gtttcgtcag tatccccgtt tacagggcgg 5640 
gattaaatat gatgaaaacg gcaacccgtg 5700 
gccgaacgat cgccagttct gtatgaacgg 5760 
agcgctgacg gaagcaaaac accagcagca 5820 
catcgaagtg accagcgaat acctgttccg 5880 
ggtggcgctg gatggtaagc cgctggcaag 5940 
aggtaaacag ttgattgaac tgcctgaact 6000 
gctcacagta cgcgtagtgc aaccgaacgc 6060 
cgcctggcag cagtggcgtc tggcggaaaa 6120 
cgccatcccg catctgacca ccagcgaaat 6180 
ttggcaattt aaccgccagt caggctttct 6240 
actgctgacg ccgctgcgcg atcagttcac 6300 
aagtgaagcg acccgcattg accctaacgc 6360 
ttaccaggcc gaagcagcgt tgttgcagtg 6420 
gattacgacc gctcacgcgt ggcagcatca 6480 
ctaccggatt gatggtagtg gtcaaatggc 6540 
tacaccgcat ccggcgcgga ttggcctgaa 6600 
aaactggctc ggattagggc cgcaagaaaa 6660 
tgaccgctgg gatctgccat tgtcagacat 6720 
cggtctgcgc tgcgggacgc gcgaattgaa 6780 
ccagttcaac atcagccgct acagtcaaca 6840 
gctgcacgcg gaagaaggca catggctgaa 6900 
cgacgactcc tggagcccgt cagtatcggc 6960 
ttaccagttg gtctggtgtc aaaaataact 7020 
tccagggctg gagagaaaac ctctgaagag 7080 
gaagctattt ggcacgactt ctacaacggg 7140 
tcctaaaaaa gaaaaaaggg tggactggga 7200 
aatgcttatg gactagggac tgtttacgaa 7260 
ctcatagtta aagcgctagc agctgcctaa 7320 
taatgacgta taagttgttc cattgtaaga 7380 
ggagtctctt tgttgaggac ttttgagttc 744 0 
atttgagatt gaaccctgtc gagtatctgt 7500 
tccgggccga gaacttcgca gcggccgctc 7560 
gtgtcaccta aatgctagag ctcgctgatc 7620 
atctgttgtt tgcccctccc ccgtgccttc 7680 
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cttgaccctg gaaggtgcca ctcccactgt 
gcattgtctg agtaggtgtc attctattct 
ggaggattgg gaagacaata gcaggcatgc 
ggcggaaaga accagctggg gctcgagggg 
aagcgcggcg ggtgtggtgg ttacgcgcag 
gcccgctcct ttcgctttct tcccttcctt 
agctctaaat cggggcatcc ctttagggtt 
caaaaaactt gattagggtg atggttcacg 
tcgccctttg acgttggagt ccacgttctt 
aacactcaac cctatctcgg tctattcttt 
ctattggtta aaaaatgagc tgatttaaca 
aacgtttaca atttaaatat ttgcttatac 
atcaaccggg gtgggtaccg agctcgaatt 
aagtccccag gctccccagg caggcagaag 
aaccaggtgt ggaaagtccc caggctcccc 
caattagtca gcaaccatag tcccgcccct 
cagttccgcc cattctccgc cccatggctg 
ggccgcctcg gcctctgagc tattccagaa 
cttttgcaaa aagctcccgg gagcttggat 
atgaggatcg tttcgcatga ttgaacaaga 
ggtggagagg ctattcggct atgactgggc 
cgtgttccgg ctgtcagcgc aggggcgccc 
tgccctgaat gaactgcagg acgaggcagc 
tccttgcgca gctgtgctcg acgttgtcac 
cgaagtgccg gggcaggatc tcctgtcatc 
catggctgat gcaatgcggc ggctgcatac 
ccaagcgaaa catcgcatcg agcgagcacg 
ggatgatctg gacgaagagc atcaggggct 
ggcgcgcatg cccgacggcg aggatctcgt 
tatcatggtg gaaaatggcc gcttttctgg 
ggaccgctat caggacatag cgttggctac 
atgggctgac cgcttcctcg tgctttacgg 
cttctatcgc cttcttgacg agttcttctg 
caagcgacgc ccaacctgcc atcacgagat 
ttgggcttcg gaatcgtttt ccgggacgcc 
atgctggagt tcttcgccca ccccaacttg 
agcaatagca tcacaaattt cacaaataaa 
ttgtccaaac tcatcaatgt atcttatcat 
cgtaatcatg gtcatagctg tttcctgtgt 
acatacgagc cggaagcata aagtgtaaag 
cattaattgc gttgcgctca ctgcccgctt 
attaatgaat cggccaacgc gcggggagag 
cctcgctcac tgactcgctg cgctcggtcg 
caaaggcggt aatacggtta tccacagaat 
caaaaggcca gcaaaaggcc aggaaccgta 
ggctccgccc ccctgacgag catcacaaaa 
cgacaggact ataaagatac caggcgtttc 
ttccgaccct gccgcttacc ggatacctgt 
tttctcaatg ctcacgctgt aggtatctca 
gctgtgtgca cgaacccccc gttcagcccg 
ttgagtccaa cccggtaaga cacgacttat 
ttagcagagc gaggtatgta ggcggtgcta 
gctacactag aaggacagta tttggtatct 
aaagagttgg tagctcttga tccggcaaac 
tttgcaagca gcagattacg cgcagaaaaa 
ctacggggtc tgacgctcag tggaacgaaa 
tatcaaaaag gatcttcacc tagatccttfc 
aaagtatata tgagtaaact tggtctgaca 
tctcagcgat ctgtctattt cgttcatcca 
ctacgatacg ggagggctta ccatctggcc 
gctcaccggc tccagattta tcagcaataa 



cctttcctaa taaaatgagg aaattgcatc 7740 
ggggggtggg gtggggcagg acagcaaggg 7800 
tggggatgcg gtgggctcta tggcttctga 7860 
ggatccccac gcgccctgta gcggcgcatt 7920 
cgtgaccgct acacttgcca gcgccctagc 7980 
tctcgccacg ttcgccggct ttccccgtca 8040 
ccgatttagt gctttacggc acctcgaccc 8100 
tagtgggcca tcgccctgat agacggtttt 8160 
taatagtgga ctcttgttcc aaactggaac 8220 
tgatttataa gggattttgg ggatttcggc 8280 
aaaatttaac gcgaatttta acaaaatatt 8340 
aatcttcctg tttttggggc ttttctgatt 8400 
ctgtggaatg tgtgtcagtt agggtgtgga 8460 
tatgcaaagc atgcatctca attagtcagc 8520 
agcaggcaga agtatgcaaa gcatgcatct 8580 
aactccgccc atcccgcccc taactccgcc 8640 
actaattttt tttatttatg cagaggccga 8700 
gtagtgagga ggcttttttg gaggcctagg 8760 
atccattttc ggatctgatc aagagacagg 8820 
tggattgcac gcaggttctc cggccgcttg 8880 
acaacagaca atcggctgct ctgatgccgc 8940 
ggttcttttt gtcaagaccg acctgtccgg 9000 
gcggctatcg tggctggcca cgacgggcgt 9060 
tgaagcggga agggactggc tgctattggg 9120 
tcaccttgct cctgccgaga aagtatccat 9180 
gcttgatccg gctacctgcc cattcgacca 9240 
tactcggatg gaagccggtc ttgtcgatca 9300 
cgcgccagcc gaactgttcg ccaggctcaa 9360 
cgtgacccat ggcgatgcct gcttgccgaa 9420 
attcatcgac tgtggccggc tgggtgtggc 9480 
ccgtgatatt gctgaagagc ttggcggcga 9540 
tatcgccgct cccgattcgc agcgcatcgc 9600 
agcgggactc tggggttcga aatgaccgac 9660 
ttcgattcca ccgccgcctt ctatgaaagg 9720 
ggctggatga tcctccagcg cggggatctc 9780 
tttattgcag cttataatgg ttacaaataa 9840 
gcattttttt cactgcattc tagttgtggt 9900 
gtctggatcc cgtcgacctc gagagcttgg 9960 
gaaattgtta tccgctcaca attccacaca 10020 
cctggggtgc ctaatgagtg agctaactca 10080 
tccagtcggg aaacctgtcg tgccagctgc 10140 
gcggtttgcg tattgggcgc tcttccgctt 10200 
ttcggctgcg gcgagcggta tcagctcact 10260 
caggggataa cgcaggaaag aacatgtgag 10320 
aaaaggccgc gttgctggcg tttttccata 10380 
atcgacgctc aagtcagagg tggcgaaacc 10440 
cccctggaag ctccctcgtg cgctctcctg 10500 
ccgcctttct cccttcggga agcgtggcgc 10560 
gttcggtgta ggtcgttcgc tccaagctgg 10620 
accgctgcgc cttatccggt aactatcgtc 10680 
cgccactggc agcagccact ggtaacagga 1074 0 
cagagttctt gaagtggtgg cctaactacg 10800 
gcgctctgct gaagccagtt accttcggaa 10860 
aaaccaccgc tggtagcggt ggtttttttg 10920 
aaggatctca agaagatcct ttgatctttt 10980 
actcacgtta agggattttg gtcatgagat 11040 
taaattaaaa atgaagtttt aaatcaatct 11100 
gttaccaatg cttaatcagt gaggcaccta 11160 
tagttgcctg actccccgtc gtgtagataa 11220 
ccagtgctgc aatgataccg cgagacccac 11280 
accagccagc cggaagggcc gagcgcagaa 11340 
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gtggtcctgc aactttatcc gcctccatcc 
taagtagttc gccagttaat agtttgcgca 
tgtcacgctc gtcgtttggt atggcttcat 
ttacatgatc ccccatgttg tgcaaaaaag 
tcagaagtaa gttggccgca gtgttatcac 
ttactgtcat gccatccgta agatgctttt 
tctgagaata gtgtatgcgg cgaccgagtt 
ccgcgccaca tagcagaact ttaaaagtgc 
aactctcaag gatcttaccg ctgttgagat 
actgatcttc agcatctttt actttcacca 
aaaatgccgc aaaaaaggga ataagggcga 
tttttcaata ttattgaagc atttatcagg 
aatgtattta gaaaaataaa caaatagggg 
ctgacgtcga cggatcggga gatctcccga 
ctctgatgcc gcatagttaa gccagtatct 
gtagtgcgcg agcaaaattt aagctacaac 
gaatctgctt agggttaggc gttttgcgct 
ttgacattga ttattgacta gttattaata 
cccatatatg gagttccgcg ttacataact 
caacgacccc cgcccattga cgtcaataat 
gactttccat tgacgtcaat gggtggacta 
tcaagtgtat catatgccaa gtacgccccc 
ctggcattat gcccagtaca tgaccttatg 
attagtcatc gctattacca tggtg 



agtctattaa ttgttgccgg gaagctagag 11400 
acgttgttgc cattgctaca ggcatcgtgg 11460 
tcagctccgg ttcccaacga tcaaggcgag 11520 
cggttagctc cttcggtcct ccgatcgttg 11580 
tcatggttat ggcagcactg cataattctc 11640 
ctgtgactgg tgagtactca accaagtcat 11700 
gctcttgccc ggcgtcaata cgggataata 11760 
tcatcattgg aaaacgttct tcggggcgaa 11820 
ccagttcgat gtaacccact cgtgcaccca 11880 
gcgtttctgg gtgagcaaaa acaggaaggc 11940 
cacggaaatg ttgaatactc atactcttcc 12000 
gttattgtct catgagcgga tacatatttg 12060 
ttccgcgcac atttccccga aaagtgccac 12120 
tcccctatgg tcgactctca gtacaatctg 12180 
gctccctgct tgtgtgttgg aggtcgctga 12240 
^aggcaaggc ttgaccgaca attgcatgaa 12300 
gcttcgcgat gtacgggcca gatatacgcg 12360 
gtaatcaatt acggggtcat tagttcatag 12420 
tacggtaaat ggcccgcctg gctgaccgcc 12480 
gacgtatgtt cccatagtaa cgccaatagg 12540 
tttacggtaa actgcccact tggcagtaca 12600 
tattgacgtc aatgacggta aatggcccgc 12660 
ggactttcct acttggcagt acatctacgt 12720 

12745 



<210> 9 
<211> 1200 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note » 
synthetic construct 

<400> 9 

atggacagca gcgctgcccc cacgaacgcc agcaattgca ctgatgcctt ggcgtactca 60 
agttgctccc cagcacccag ccccggttcc tggatcaact tgtcccactt agatggcaac 120 
ctgtccgacc catgcggtcc gaaccgcacc gacctgggcg ggagagacag cctgtgccct 180 
ccgaccggca gtccctccat gatcacggcc atcacgatca tggccctcta ctccatcgtg 240 
tgcgtggtgg ggctcttcgg aaacttcctg gtcatgtatg tgattgtcag atacaccaag 300 
atgaagactg ccaccaacat ctacattttc aaccttgctc tggcagatgc cttagccacc 360 
agtaccctgc ccttccagag tgtgaattac ctaatgggaa catggccatt tggaaccatc 420 
ctttgcaaga tagtgatctc catagattac tataacatgt tcaccagcat attcaccctc 480 
tgcaccatga gtgttgatcg atacattgca gtctgccacc ctgtcaaggc cttagatttc 540 
cgtactcccc gaaatgccaa aattatcaat gtctgcaact ggatcctctc ttcagccatt 600 
ggtcttcctg taatgttcat ggctacaaca aaatacaggc aaggttccat agattgtaca 660 
ctaacattct ctcatccaac ctggtactgg gaaaacctgc tgaagatctg tgttttcatc 720 
ttcgccttca ttatgccagt gctcatcatt accgtgtgct atggactgat gatcttgcgc 780 
ctcaagagtg tccgcatgct ctctggctcc aaagaaaagg acaggaatct tcgaaggatc 840 
accaggatgg tgctggtggt ggtggctgtg ttcatcgtct gctggactcc cattcacatt 900 
tacgtcatca ttaaagcctt ggttacaatc ccagaaacta cgttccagac tgtttcttgg 960 
cacttctgca ttgctctagg ttacacaaac agctgcctca acccagtcct ttatgcattt 1020 
ctggatgaaa acttcaaacg atgcttcaga gagttctgta tcccaacctc ttccaacatt 1080 
gagcaacaaa actccactcg aattcgtcag aacactagag accacccctc cacggccaat 1140 
acagtggata gaactaatca tcagctagaa aatctggaag cagaaactgc tccgttgccc 1200 



<210> 10 
<211> 400 
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<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial 
synthetic construct 



<400> 10 








neu 


ASp 


ser 


Ser Ala Ala 


Pro 


Thr 


X 






5 






Leu 


TV T 

Ala 


Tyr 


Ser Ser Cys 


Ser 


Pro 








20 






Asn 


Leu 


Ser 


His Leu Asp 


Gly 


Asn 












40 


Arg 


Tnr 


ASp 


Leu Oly Gly 


Arg 


Asp 










55 




Pro 


Ser 


Mec 


lie Tnr Ala 


He 


Thr 


c c 
bb 






70 






Cys 


vai 


vai 


Gly Leu Phe 


Gly 


Asn 








85 






Arg 


Tyr 


Tnr 


Lys Met Lys 


Thr 


Ala 








100 






Ala 


Leu 


Ala 


Asp Ala Leu 


Ala 


Thr 






115 






120 


Asn 


Tyr 


Leu 


Met Gly Thr 


Trp 


Pro 




J. 






135 




vai 


T*l A 

x±e 


Ser 


He Asp Tyr 


Tyr 


Asn 








150 






Cys 


Thr 


Met 


Ser Vai Asp 


Arg 


Tyr 








165 






■J. 1 _^ 

Ala 


Leu 


Asp 


Phe Arg Thr 


Pro 


Arg 








180 






Asn 


Trp 


He 


Leu Ser Ser 


Ala 


He 






195 






200 


Thr 


Thr 


Lys 


Tyr Arg Gin 


Gly 


Ser 




210 






215 




His 


Pro 


Thr 


Trp Tyr Trp 


Glu 


Asn 


o o e 

225 






230 






Pne 


Ala 


File 


He Met Pro 


Vai 


Leu 








245 






Met 


lie 


Leu 


Arg Leu Lys 


Ser 


Vai 








260 






Lys 


Asp 


Arg 


Asn Leu Arg 


Arg 


He 






275 






280 


Ala 


Vai 


Phe 


He Vai Cys 


Trp 


Thr 




290 






295 




Lys 


Ala 


Leu 


Vai Thr He 


Pro 


Glu 


305 






310 






His 


Phe 


Cys 


He Ala Leu 


Gly 


Tyr 








325 






Leu 


Tyr 


Ala 


Phe Leu Asp 


Glu 


Asn 








340 






Cys 


He 


Pro 


Thr Ser Ser 


Asn 


He 






355 






360 


Arg 


Gin 


Asn 


Thr Arg Asp 


His 


Pro 




370 






375 




Thr 


Asn 


His 


Gin Leu Glu 


Asn 


Leu 


385 






390 







Sequence : /note = 



Asn 


Ala 


Ser 


Asn 


Cys 


Thr 


Asp 


Ala 




10 










15 




Ala 


Pro 


Ser 


Pro 


Gly Ser 


Trp 


He 


25 










30 






Leu 


Ser 


Asp 


Pro 


Cys Gly 


Pro 


Asn 










45 








Ser 


Leu 


Cys 


Pro 


Pro 


Thr 


Gly 


Ser 








60 










He 


Met 


Tl T „ 

Ala 


Leu 


Tyr 


Ser 


He 


Val 






75 










80 


Phe 


Leu 


Vai 


Met 


Tyr Val 


He 


Val 




90 










95 




Thr 


Asn 


He 


Tyr 


He 


Phe 


Asn 


Leu 


105 










110 






Ser 


Thr 


Leu 


Pro 


Phe 


Gin 


Ser 


Val 










125 








Phe 


Gly 


Thr 


He 


Leu Cys 


Lys 


He 








140 










Met 


Pne 


Thr 


Ser 


He 


Phe 


Thr 


Leu 






155 










160 


He 


TV 1 ^ 

Ala 


Vai 


Cys 


His 


Pro 


Val 


Lys 




1*70 










175 




Asn 


Ala 


Lys 


He 


He 


Asn 


Val 


Cys 


185 










190 






Gly 


Leu 


Pro 


Vai 


Met 


Phe 


Met 


Ala 










205 








He 


Asp 


Cys 


Thr 


Leu 


Thr 


Phe 


Ser 








220 










Leu 


Leu 


Lys 


He 


Cys 


Val 


Phe 


He 






235 










240 


He 


He 


Thr 


Vai 


Cys 


Tyr 


Gly 


Leu 




250 










255 




Arg 


Met 


Leu 


Ser 


Gly Ser 


Lys 


Glu 


265 










270 






Thr 


Arg 


Met 


Vai 


Leu 


Val 


Val 


Val 










285 








Pro 


He 


His 


He 


Tyr Val 


He 


He 








300 










Thr 


Thr 


Phe 


Gin 


Thr 


Val 


Ser 


Trp 






315 










320 


Thr 


Asn 


Ser 


Cys 


Leu 


Asn 


Pro 


Val 




330 










335 




Phe 


Lys 


Arg 


Cys 


Phe Arg 


Glu 


Phe 


345 










350 






Glu 


Gin 


Gin 


Asn 


Ser 


Thr 


Arg 


He 










365 








Ser 


Thr 


Ala 


Asn 


Thr 


Val 


Asp 


Arg 








380 










Glu 


Ala 


Glu 


Thr 


Ala 


Pro 


Leu 


Pro 






395 










400 



<210> 11 

i 
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<211> 1986 
<212> x>m 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note « 
synthetic construct 

<400> 11 

tcctcctctg ctcgcccaat ccttccaacc ccctatggtg gtatggctga cacagaaaat 60 
gtctgctcct gtatgggaca tttgcccctc ttctccaaat ataagacagg atgaggccta 120 
gcttttgctg ctccaaagtt ttaaaagaac acattgcacg gcatttaggg actctaaagg 180 
gtggaggagg aatgagggaa ttgcatcatg ccaaggctgg tcctcatcca tcactgcttc 240 
cagggcccag agtggcttcc aggaggtatt cttacaaagg aagcccgatc tgtagctaac 300 
actcagagcc cattttcctg cgttaacccc tcccgacctc atatacagga gtaacatgat 360 
cagtgacctg ggggagctgg ccaaactgcg ggacctgccc aagctgaggg ccttggtgct 420 
gctggacaac ccctgtgccg atgagactga ctaccgccag gaggccctgg tgcagatggc 480 
acacctagag cgcctagaca aagagtacta tgaggacgag gaccgggcag aagctgagga 540 
gatccgacag aggctgaagg aggaacagga gcaagaactc gacccggacc aagacatgga 600 
accgtacctc ccgccaactt agtggctcct ctagcctgca gggacagtaa aggtgatggc 660 
aggaaggcag cccccggagg tcaaaggctg ggcacgcggg aggagaggcc agagtcagag 720 
gctgcgggta tctcagatat gaaggaaaga tgagagaggc tcaggaagag gtaagaaaag 780 
acacaagaga ccagagaagg gagaagaatt agagagggag gcagaggacc gctgtctcta 840 
cagacatagc tggtagagac tgggaggaag ggatgaaccc tgagcgcatg aagggaagga 900 
ggtggctggt ggtatatgga ggatgtagct gggccaggga aaagatcctg cactaaaaat 960 
ctgaagctaa aaataacagg acacggggtg gagaggcgaa aggagggcag attgaggcag 1020 
agagactgag aggcctgggg atgtgggcat tccggtaggg cacacagttc acttgtcttc 1080 
tctttttcca ggaggccara gatgctgacc tcaagaactc ataatacccc agtggggacc 1140 
accgcattca tagccctgtt acaagaagtg ggagatgttc ctttttgtcc cagactggaa 1200 
atccattaca tcccgaggct caggttctgt ggtggtcatc tctgtgtggc ttgttctgtg 1260 
ggcctaccta aagtcctaag cacagctctc aagcagatcc gaggcgacta agatgctagt 1320 
aggggttgtc tggagagaag agccgaggag gtgggctgtg atggatcagt tcagctttca 1380 
aataaaaagg cgtttttata ttctgtgtcg agttcgtgaa cccctgtggt gggcttctcc 1440 
atctgtctgg gttagtacct gccactatac tggaataagg ggacgcctgc ttccctcgag 1500 
ttggctggac aaggttatga gcatccgtgt acttatgggg ttgccagctt ggtcctggat 1560 
cgcccgggcc cttcccccac ccgttcggtt ccccaccacc acccgcgctc gtacgtgcgt 1620 
ctccgcctgc agctcttgac tcatcggggc ccccgggtca catgcgctcg ctcggctcta 1680 
taggcgccgc cccctgccca ccccccgccc gcgctgggag ccgcagccgc cgccactcct 1740 
gctctctctg cgccgccgcc gtcaccaccg ccaccgccac cggctgagtc tgcagtcctc 1800 
gaggaactga aaaaccagaa agttaactgg taagtttagt ctttttgtct tttatttcag 1860 
gtcccggatc cggtggtggt gcaaatcaaa gaactgctcc tcagtggatg ttgcctttac 1920 
ttctaggcct gtacggaagt gttacttctg ctctaaaagc tgcggaattg tacccgcggc 1980 
caagct 1986 

<210> 12 
<211> 5982 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note » 
synthetic construct 

<400> 12 

atgcggtttt ggcagtacat caatgggcgt ggatagcggt ttgactcacg gggatttcca 60 
agtctccacc ccattgacgt caatgggagt ttgttttggc accaaaatca acgggacttt 120 
ccaaaatgtc gtaacaactc cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg 180 
gaggtctata taagcagagc tctgtgaaac ttcgaggagt ctctttgttg aggacttttg 240 
agttctccct tgaggctccc acagatacaa taaatatttg agattgaacc ctgtcgagta 300 
tctgtgtaat cttttttacc tgtgaggtct cggaatccgg gccgagaact tcgcagttgg 360 
cgcccgaaca gggacttgat tgagagtgat tgaggaagtg aagctagagc aatagaaagc 420 
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tgttaagcag aactcctgct gacctaaata gggaagcagt agcagacgct gctaacagtg 480 
agtatctcta gtgaagcgga ctcgagctca taatcaagtc attgtttaaa ggcccagata 540 
aattacatct ggtgactctt cgcggacctt caagccagga gattcgccga gggacagtca 600 
acaaggtagg agagattcta cagcaacatg gggaatggac aggggcgaga ttggaaaatg 660 
gccattaaga gatgtagtaa tgttgctgta ggagtagggg ggaagagtaa aaaatttgga 720 
gaagggaatt tcagatgggc cattagaatg gctaatgtat ctacaggacg agaacctggt 780 
gatataccag agactttaga tcaactaagg ttggttattt gcgatttaca agaaagaaga 840 
gaaaaatttg gatctagcaa agaaattgat atggcaattg tgacattaaa agtctttgcg 900 
gtagcaggac ttttaaatat gacgggtgtc tactgctgct gcagctgaaa atatgtattc 960 
tcaaatggga ttagacacta ggccatctat gaaagaagca ggtggaaaag aggaaggccc 1020 
tccacaggca tatcctattc aaacagtaaa tggagtacca caatatgtag cacttgaccc 1080 
aaaaatggtg tccattttta tggaaaaggc aagagaagga ctaggaggtg aggaagttca 1140 
actatggttt actgccttct ctgcaaattt aacacctact gacatggcca cattaataat 1200 
ggccgcacca gggtgcgctg cagataaaga aatattggat gaaagcttaa agcaactgac 1260 
agcagaatat gatcgcacac atccccctga tgctcccaga ccattaccct attttactgc 1320 
agcagaaatt atgggtatag gattaactca agaacaacaa gcagaagcaa gatttgcacc 1380 
agctaggatg cagtgtagag catggtatct cgaggcatta ggaaaattgg ctgccataaa 1440 
agctaagtct cctcgagctg tgcagttaag acaaggagct aaggaagatt attcatcctt 1500 
tatagacaga ttgtttgccc aaatagatca agaacaaaat acagctgaag ttaagttata 1560 
tttaaaacag tcattgagca tagctaatgc taatgcagac tgtaaaaagg caatgagcca 1620 
ccttaagcca gaaagtaccc tagaagaaaa gttgagagct tgtcaagaaa taggctcacc 1680 
aggatataaa atgcaactct tggcagaagc tcttacaaaa gttcaagtag tgcaatcaaa 1740 
aggatcagga ccagtgtgtt ttaattgtaa aaaaccagga catctagcaa gacaatgtag 1800 
agaagtgaaa aaatgtaata aatgtggaaa acctggtcat gtagctgcca aatgttggca 1860 
aggaaataga aagaattgta caagggaaga aagggataca acaattacaa aagtgggaag 1920 
attgggtagg atggatagga aatattccac aatatttaaa gggactattg ggaggtatct 1980 
tgggaatagg attaggagtg ttattattga ttttatgttt acctacattg gttgattgta 2040 
taagaaattg tatccacaag atactaggat acacagtaat tgcaatgcct gaagtagaag 2100 
gagaagaaat acaaccacaa atggaattga ggagaaatgg taggcaatgt ggcatgtctg 2160 
aaaaagagga ggaatgatga agtatctcag acttatttta taagggagat actgtgctga 2220 
gttcttccct ttgaggaagg tatgtcatat gaatccattt cgaaatcaaa ttagagctcg 2280 
ctgatcagcc tcgactgtgc cttctagttg ccagccatct gttgtttgcc cctcccccgt 2340 
gccttccttg accctggaag gtgccactcc cactgtcctt tcctaataaa atgaggaaat 2400 
tgcatcgcat tgtctgagta ggtgtcattc tattctgggg ggtggggtgg ggcaggacag 2460 
caagggggag gattgggaag acaatagcag gcatgctggg gtaaaaaaga aaaaagggtg 2520 
gactgggatg agtattggaa ccctgaagaa atagaaagaa tgcttatgga ctagggactg 2580 
tttacgaaca aatgataaaa ggaaatagct gagcatgact catagttaaa gcgctagcag 2640 
ctgcctaacc gcaaaaccac atcctatgga aagcttgcta atgacgtata agttgttcca 2700 
ttgtaagagt atataaccag tgctttgtga aacttcgagg agtctctttg ttgaggactt 2760 
ttgagttctc ccttgaggct cccacagata caataaatat ttgagattga accctgtcga 2820 
gtatctgtgt aatctttttt acctgtgagg tctcggaatc cgggccgaga acttcgcagc 2880 
ggccgctcat gaccgaccaa gcgacgccca acctgccatc acgagatttc gattccaccg 2940 
ccgccttcta tgaaaggttg ggcttcggaa tcgttttccg ggacgccggc tggatgatcc 3000 
tccagcgcgg ggatctcatg ctggagttct tcgcccaccc caacttgttt attgcagctt 3060 
ataatggtta caaataaagc aatagcatca caaatttcac aaataaagca tttttttcac 3120 
tgcattctag ttgtggtttg tccaaactca tcaatgtatc ttatcatgtc tggatcccgt 3180 
cgacctcgag agcttggcgt aatcatggtc atagctgttt cctgtgtgaa attgttatcc 3240 
gctcacaatt ccacacaaca tacgagccgg aagcataaag tgtaaagcct ggggtgccta 3300 
atgagtgagc taactcacat taattgcgtt gcgctcactg cccgctttcc agtcgggaaa 3360 
cctgtcgtgc cagctgcatt aatgaatcgg ccaacgcgcg gggagaggcg gtttgcgtat 3420 
tgggcgctct tccgcttcct cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg 3480 
agcggtatca gctcactcaa aggcggtaat acggttatcc acagaatcag gggataacgc 3540 
aggaaagaac atgtgagcaa aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt 3600 
gctggcgttt ttccataggc tccgcccccc tgacgagcat cacaaaaatc gacgctcaag 3660 
tcagaggtgg cgaaacccga caggactata aagataccag gcgtttcccc ctggaagctc 3720 
cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga tacctgtccg cctttctccc 3780 
ttcgggaagc gtggcgcttt ctcaatgctc acgctgtagg tatctcagtt cggtgtaggt 3840 
cgttcgctcc aagctgggct gtgtgcacga accccccgtt cagcccgacc gctgcgcctt 3900 
atccggtaac tatcgtcttg agtccaaccc ggtaagacac gacttatcgc cactggcagc 3960 
agccactggt aacaggatta gcagagcgag gtatgtaggc ggtgctacag agttcttgaa 4020 
gtggtggcct aactacggct acactagaag gacagtattt ggtatctgcg ctctgctgaa 4080 
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gccagttacc ttcggaaaaa gagttggtag 
tagcggtggt ttttttgttt gcaagcagca 
agatcctttg atcttttcta cggggtctga 
gattttggtc atgagattat caaaaaggat 
aagttttaaa tcaatctaaa gtatatatga 
aatcagtgag gcacctatct cagcgatctg 
ccccgtcgtg tagataacta cgatacggga 
gataccgcga gacccacgct caccggctcc 
aagggccgag cgcagaagtg gtcctgcaac 
ttgccgggaa gctagagtaa gtagttcgcc 
tgctacaggc atcgtggtgt cacgctcgtc 
ccaacgatca aggcgagtta catgatcccc 
cggtcctccg atcgttgtca gaagtaagtt 
agcactgcat aattctctta ctgtcatgcc 
gtactcaacc aagtcattct gagaatagtg 
gtcaatacgg gataataccg cgccacatag 
acgttcttcg gggcgaaaac tctcaaggat 
acccactcgt gcacccaact gatcttcagc 
agcaaaaaca ggaaggcaaa atgccgcaaa 
aatactcata ctcttccttt ttcaatatta 
gagcggatac atatttgaat gtatttagaa 
tccccgaaaa gtgccacctg acgtcgacgg 
actctcagta caatctgctc tgatgccgca 
gtgttggagg tcgctgagta gtgcgcgagc 
accgacaatt gcatgaagaa tctgcttagg 
cgggccagat atacgcgttg acattgatta 
gggtcattag ttcatagccc atatatggag 
ccgcctggct gaccgcccaa cgacccccgc 
atagtaacgc caatagggac tttccattga 
gcccacttgg cagtacatca agtgtatcat 
gacggtaaat ggcccgcctg gcattatgcc 
tggcagtaca tctacgtatt agtcatcgct 



ctcttgatcc ggcaaacaaa ccaccgctgg 4140 
gattacgcgc agaaaaaaag gatctcaaga 4200 
cgctcagtgg aacgaaaact cacgttaagg 4260 
cttcacctag atccttttaa attaaaaatg 4320 
gtaaacttgg tctgacagtt accaatgctt 4380 
tctatttcgt tcatccatag ttgcctgact 4440 
gggcttacca tctggcccca gtgctgcaat 4500 
agatttatca gcaataaacc agccagccgg 4560 
tttatccgcc tccatccagt ctattaattg 4620 
agttaatagt ttgcgcaacg ttgttgccat 4680 
gtttggtatg gcttcattca gctccggttc 4740 
catgttgtgc aaaaaagcgg ttagctcctt 4800 
ggccgcagtg ttatcactca tggttatggc 4860 
atccgtaaga tgcttttctg tgactggtga 4 920 
tatgcggcga ccgagttgct cttgcccggc 4980 
cagaacttta aaagtgctca tcattggaaa 5040 
cttaccgctg ttgagatcca gttcgatgta 5100 
atcttttact ttcaccagcg tttctgggtg 5160 
aaagggaata agggcgacac ggaaatgttg 5220 
ttgaagcatt tatcagggtt attgtctcat 5280 
aaataaacaa ataggggttc cgcgcacatt 5340 
atcgggagat ctcccgatcc cctatggtcg 5400 
tagttaagcc agtatctgct ccctgcttgt 5460 
aaaatttaag ctacaacaag gcaaggcttg 5520 
gttaggcgtt ttgcgctgct tcgcgatgta 5580 
ttgactagtt attaatagta atcaattacg 5640 
ttccgcgtta cataacttac ggtaaatggc 5700 
ccattgacgt caataatgac gtatgttccc 5760 
cgtcaatggg tggactattt acggtaaact 5820 
atgccaagta cgccccctat tgacgtcaat 5880 
cagtacatga ccttatggga ctttcctact 5940 
attaccatgg tg 5982 



<210> 13 

<211> 13361 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 13 

tggcagtaca tcaagtgtat catatgccaa gtacgccccc tattgacgtc aatgacggta 60 
aatggcccgc ctggcattat gcccagtaca tgaccttatg ggactttcct acttggcagt 120 
acatctacgt attagtcatc gctattacca tggtgatgcg gttttggcag tacatcaatg 180 
ggcgtggata gcggtttgac tcacggggat ttccaagtct ccaccccatt gacgtcaatg 240 
ggagtttgtt ttggcaccaa aatcaacggg actttccaaa atgtcgtaac aactccgccc 300 
cattgacgca aatgggcggt aggcgtgtac ggtgggaggt ctatataagc agagctctct 3 60 
ggctaactag agaacccact gcttaactgg cttatcgaaa ttaatacgac tcactatagg 420 
gagacccaag cttggtaccg agctcggatc cactagtaac ggccgccagt gtgctggaat 480 
tctgcagata tccatcacac tggcggccca taatcaagtc attgtttaaa ggcccagata 540 
aattacatct ggtgactctt cgcggacctt caagccagga gattcgccga gggacagtca 600 
acaaggtagg agagattcta cagcaacatg gggaatggac aggggcgaga ttggaaaatg 660 
gccattaaga gatgtagtaa tgttgctgta ggagtagggg ggaagagtaa aaaatttgga 720 
gaagggaatt tcagatgggc cattagaatg gctaatgtat ctacaggacg agaacctggt 780 
gatataccag agactttaga tcaactaagg ttggttattt gcgatttaca agaaagaaga 840 
gaaaaatttg gatctagcaa agaaattgat atggcaattg tgacattaaa agtctttgcg 900 
gtagcaggac ttttaaatat gacggtgtct actgctgctg cagctgaaaa tatgtattct 960 
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caaatgggat tagacactag gccatctatg 
ccacaggcat atcctattca aacagtaaat 
aaaatggtgt ccatttttat ggaaaaggca 
ctatggttta ctgccttctc tgcaaattta 
gccgcaccag ggtgcgctgc agataaagaa 
gcagaatatg atcgcacaca tccccctgat 
gcagaaatta tgggtatagg attaactcaa 
gctaggatgc agtgtagagc atggtatctc 
gctaagtctc ctcgagctgt gcagttaaga 
atagacagat tgtttgccca aatagatcaa 
ttaaaacagt cattgagcat agctaatgct 
cttaagccag aaagtaccct agaagaaaag 
ggatataaaa tgcaactctt ggcagaagct 
ggatcaggac cagtgtgttt taattgtaaa 
gaagtgaaaa aatgtaataa atgtggaaaa 
ggaaatagaa agaattcggg aaactggaag 
atgcagcaag cagtaatgcc atctgcacct 
attataataa agtaggtact actacaacat 
taaatggata tcctataaaa tttttattag 
ggagagattt tcsiagtaaaa aattctatag 
gaggaggaaa gagaggaaca aattatatta 
ataagacaca atgtatattt ggtaatgttt 
cattattagg gagagataat atgattaaat 
ctgataagat tccagtagta aaagtaaaaa 
aacaatggcc attaacaaat gaaaaaattg 
aaagagaagg gaaagtaaaa agagcagatc 
ctataaaaEia gaaaagtgga aaatggagaa 
taactgagaa aggagcagag gtccagttgg 
aaaaacaagt aacagtatta gatatagggg, 
attatgctcc ttatacagca tttactttac 
gatttgtgtg gtgtagtcta ccacaaggct 
cattagataa tataatacaa ccttttatta 
atatggatga catttatata ggatcaaatt 
aagaattaag aaaattacta ttatggtggg 
aagaaccccc atatacatgg atgggttatg 
agaaacagtt agacattcca gaacagccca 
aaattaattg ggctagccaa gctattccag 
tgagaggaaa tcaaaaccta aattcaacaa 
tacaaaaggc aaaaaaggct atagaagaac 
aggagttata tgctaaatta agtttggtgg 
agaaggatcc agaaaagata ctatggtatg 
aaaatacatg tgatatagcc ttaagagcat 
gaataggaaa agaaccaaga tatgaaatac 
taattaattc accatatctt aaggccccac 
tgaatataaa gagagcgtta agtatgataa 
ggtatataga tggaggtaga aagctaggaa 
caggaaagtg gcaagtgatg gaattagaag 
tattattggc attaaaagca ggatcagagg 
ttataaatat tattcttcaa caaccagata 
aagaattgga gaagaaaaca gcaatattta 
caggaaatga ggaagtagat aagctttgtc 
tattagataa aaggtcagaa gatgcaggat 
tattgccagg agaggtaaaa gtaataccaa 
attggggatt aataatagga aaaagctcga 
gggtaataga cgaaggatat cgaggtgaaa 
aatcaatcac cttaatggaa cgacaaaaga 
atgaagtatt agaacaagga aaagtagtaa 
ggtcaacagg agtattctcc tcttgggttg 
aaaaatttca ctcagatcca cagtacttaa 
cagaagagat aagacgaaaa tgcccagtat 
aattgaaaat agggcctggt atctggcaaa 



aaagaagcag gtggaaaaga ggaaggccct 1020 
ggagtaccac aatatgtagc acttgaccca 1080 
agagaaggac taggaggtga ggaagttcaa 1140 
acacctactg acatggccac attaataatg 1200 
atattggatg aaagcttaaa gcaactgaca 1260 
gctcccagac cattacccta ttttactgca 1320 
gaacaacaag cagaagcaag atttgcacca 1380 
gaggcattag gaaaattggc tgccataaaa 1440 
caaggagcta aggaagatta ttcatccttt 1500 
gaacaaaata cagctgaagt taagttatat 1560 
aatgcagact gtaaaaaggc aatgagccac 1620 
ttgagagctt gtcaagaaat aggct caeca 1680 
cttacaaaag ttcaagtagt gcaatcaaaa 1740 
aaaccaggac atctagcaag acaatgtaga IB 00 
cctggtcatg tagctgccaa atgttggcaa 1860 
gcggggcgag ctgcagcccc agtgaatcaa 1920 
ccaatggagg agaaactatt ggatttataa 1980 
tagaaaagag gccagaaata ctcatatttg 2040 
acacaggagc agatataaca attttaaata 2100 
aaaatggaag gcaaaatatg attggagtag 2160 
atgtacattt agagattaga gatgaaaatt 2220 
gtgtcttaga agataactca ttaatacaac 2280 
tcaatattag gttagtaatg gctcaaattt 2340 
tgaaggatcc taataaagga cctcaaataa 2400 
aagccttaac agaaatagta gaaagactag 2460 
caaataatcc atggaataca ccagtatttg 2520 
tgctcataga ttttagagaa ttaaacaaac 2580 
gactacctca tcctgctggt ttacaaataa 2640 
atgcatattt caccattcct cttgatccag 2700 
ctagaaaaaa taatgcggga ccaggaagga 2760 
ggattttaag tccattgata tatcaaagta 2820 
gacaaaatcc tcaattagat atttaccaat 2880 
taagtaaaaa ggagcataaa gaaaaggtag 2940 
gatttgaaac tccagaagat aaattacagg 3000 
aattacatcc attaacatgg acaatacaac 3060 
ctctaaatga gttgcaaaaa ttagcaggaa 3120 
acttgagtat aaaagcatta actaacatga 3180 
gacaatggac taaagaagct cgactggaag 3240 
aagtacaact aggatactat gaccccagta 3300 
gaccacatca aataagttat caagtatatc 3360 
gaaaaatgag tagacaaaag aaaaaggcag 3420 
gctataagat aagagaagag tctattataa 3480 
ctacttctag agaagcctgg gaatcaaatt 3540 
ctcctgaggt agaatatatc catgctgctt 3600 
aagatgctcc aataccagga gcagaaacat 3660 
aagcagcaaa agcagcctat tggacagata 3720 
gcagtaatca gaaggcagaa atacaagcat 3780 
agatgaatat tataacagat tcacaatatg 3840 
tgatggaggg aatctggcaa gaagttttag 3900 
tagattgggt cccaggacat aaaggtattc 3960 
aaacaatgat gataatagaa ggggatggga 4020 
atgatttatt agctgcaaaa gaaatacatt 4080 
caggggtaaa gctaatgttg cctaaaggat 4140 
tagggagtaa aggattggat gtattaggag 4200 
ttggagtaat aatgattaat gtatcaagaa 4260 
tagcacaatt aataatattg ccttgtaaac 4320 
tggattcaga gagaggagac aatggttatg 4380 
acagaattga ggaagcagaa ataaatcatg 4440 
ggactgaatt taatttacct aaaatggtag 4500 
gcagaatcag aggagaacaa gtgggaggac 4560 
tggattgcac acactttgat ggcaaaataa 4620 
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ttcttgtggg tatacatgtg gaatcaggat 
ctgctgactg tacagttaaa gctgtcttac 
tacaaacaga taatggacca aattttaaaa 
tgggtgtgaa acataagttt ggtatcccag 
atgtaaatca tacattaaaa gtttggattc 
ataatgcctt atctctcgct gtacatagtc 
ggatggcccc ttatgaatta ttagcacaac 
ctgcaatacc acaaaaattg caagcacagt 
aatggaaagg accaatgaga gtagaatact 
aagagaaggg atattttctt atacctagga 
ctcttcctga aggggatgag tgaagaagat 
ctccaaggag gagtaaatag cgctatgcta 
gaaaagtata aaaaagactt caagaaaaga 
agactacgga aagctgaagg aataaaatgg 
tatgtcagag aaatggtggc aggatccact 
ataagtaacc cactatggca ttctcagtat 
tggccttttg taaatatgtg gataaaaaca 
aatatttgta taggaggaga agtttcacca 
ataaaagctt ttagttgtgg cgaaagaaag 
ggagaaatag atccaaaaaa atggtgcgga 
tcacctccaa agactttaca aagactcgct 
tggcgaggat gctgtaatca acgctttgtt 
gtcattcaat ccaagcccag ctggaacctg 
ataatagtat tattcaatag ggtcactgag 
tttgtattag cacatcaatt agaaagggac 
tggagatata gatttaagaa accccgagta 
ttctattatt ggcagttgca atctacatta 
taatatttca tttgcaacaa taagaatggc 
aggactagaa gaagctgaag agttattaga 
aggaccacta aatccaggag taaacccatt 
gcaaaactac tgtaacatat tacaacctaa 
ggtaaaactg gaagaaggaa atgcaggtaa 
tgatgaaagt gtattgtccc tggttcatgc 
tcgaaataag ttaggatctt taagacatga 
ttataataat agagagaagg gtacaactga 
aggaacggtg actttgtacc tgattttatt 
caacgctcag gtagtatgga gacttccacc 
aattttttgg gattgttggg caccagaaga 
gatacatcta aaagctaaga caaatataag 
ggctagagaa atatgggcaa cattattcaa 
aatatggaaa agatggaatg agactataac 
ttataatgtt tcagtaatag tacctgatta 
gttacaaggg aaaataaata tatcattatg 
agttacaaaa caattaagct attgtacaga 
atttggacct aatcaaacat gtatgtggaa 
aaggccgcgg ccctggcaac ccatcaagaa 
ataaacaact taagattagt tacattagag 
gaagctatgg aaaaattttt gtatacagct 
aatcaatttt tctgcaaaat ccctcctgag 
caaacaatat ggaatcatgg aaatataact 
ttacaacaaa agttttatga aataataatg 
aaagggatac aacaattaca aaagtgggaa 
caatatttaa agggactatt gggaggtatc 
attttatgtt tacctacatt ggttgattgt 
tacacagtaa ttgcaatgcc tgaagtagaa 
aggagaaatg gtaggcaatg tggcatgtct 
gacttatttt ataagggaga tactgtgctg 
tgaatccatt tcgaatcaaa tcaaactaat 
acaaagaaga agaagaaaga agaaagcctt 
cttcaggaag ctatttggca cgacttctac 
tgaacctcct aaaaaagaaa aaagggtgga 



atatatgggc acaaataatt tctcaagaaa 4680 
aattgttgag tgctcataat gttactgaat 4740 
atcaaaagat ggaaggagta ctcaattaca 4800 
ggaacccaca gtcacaagca ttagttgaaa 4860 
ggaaattttt gcctgaaaca acctccttgg 4920 
tcaattttaa aagaagaggt aggataggag 4980 
aagaatcctt aagaatacaa gattattttt 5040 
ggatttatta taaagatcaa aaagataaga 5100 
Sgggacaggg atcagtatta ttaaaggatg 5160 
gacacataag gagagttcca gaaccctgcg 5220 
tggcaggtaa gtagaagact ctttgcagtg 5280 
tacatatcta ggfctacctcc ggatgaaaga 5340 
ctttttgac'a cagaaacagg atttataaag 5400 
agctttcata ctagagatta ttacatagga 5460 
acatcattaa gtctaaggat gtatatatat 5520 
cgtccaggtt tgaaaaattt caataaggaa 5580 
ggatttatgt gggatgatat tgaaaaacaa 5640 
ggatggggac cagggatggt aggtatagca 5700 
attgaggcta ctcctgtaat gattataaga 5760 
gattgttgga atttaatgtg tcttagaaac 5820 
atgttggcgt gtggcgtgcc ggctaagaag 5880 
tctccttaca gaacgcctgc tgatttagag 5940 
ttatggtcgg gagaattatg aatggaagac 6000 
aaactagaaa aagaattagc tatcagaata 6060 
aaagctatta gat tact aca aggattattt 6120 
gattattgtt tatgttggtg gtgttgcaaa 6180 
tcaataacta ctgcttagaa atatttagat 6240 
agaaggattt gcagccaata gacaatggat 6300 
ttttgatata gcaacacaaa tgagtgaaga 6360 
tagggtacct ggaataacag aaaaagaaaa 6420 
gttacaagat ctaaggaacg aaattcaaga 6480 
gtttagaaga gcaagatttt taaggtattc 6540 
gttcatagga tattgtatat atttaggtaa 6600 
cattgatata gaagcacccc aagaagagtg 6660 
caatataaaa tatggtagac gatgttgcct 6720 
tataggaata ataatatatt cacagacaac 6780 
attagtagtc ccagtagaag aatcagaaat 6840 
acccgcctgt caggactttc ttggggcaat 6900 
tatacgagag ggacctacct tggggaattg 6960 
aaaggctact agacaatgta gaagaggcag 7020 
aggaccabca ggatgtgcta ataacacatg 7080 
tcagtgttat ttagatagag tagatacttg 7140 
tctaacagga ggaaaaatgt tgtacaataa 7200 
cccattacaa atcccactga tcaattatac 7260 
tacttcacaa attcaggacc ctgaaatacc 7320 
gctgtagaaa aggtgactga agccttaaag 7380 
catcaagtac tagtaatagg attaaaagta 7440 
ttcgctatgc aagaattagg atgtaatcaa 7500 
ttgtggacaa ggtataatat gactataaat 7560 
ttgggggaat ggtataacca aacaaaagat 7620 
gacatagaac aaaataatgt acaagggaag 7680 
gattgggtag gatggatagg aaatattcca 7740 
ttgggaatag gattaggagt gttattattg 7800 
ataagaaatt gtatccacaa gatactagga 7860 
ggagaagaaa tacaaccaca aatggaattg 7920 
gaaaaagagg aggaatgatg aagtatctca 7980 
agttcttccc tttgaggaag gtatgtcata 8040 
aaagtatgta ttgtaaggta aaaggaaaag 8100 
caagaggatg atgacagagt tagaagatcg 8160 
aacgggagac agcacagtag attctgaaga 8220 
ctgggatgag tattggaacc ctgaagaaat 8280 



18/53 



wo 2UU4/073646 



PCT/US20U4/004914 



agaaagaatg cttatggact agggactgtt 
ctagagggcc ctattctata gtgtcaccta 
gtgccttcta gttgccagcc atctgttgtt 
gaaggtgcca ctcccactgt cctttcctaa 
agtaggtgtc attctattct ggggggtggg 
gaagacaata gcaggcatgc tggggatgcg 
accagctggg gctcgagggg ggatccccac 
ggtgtggtgg ttacgcgcag cgtgaccgct 
ttcgctttct tcccttcctt tctcgccacg 
cggggcatcc ctttagggtt ccgatttagt 
gattagggtg atggttcacg tagtgggcca 
acgttggagt ccacgttctt taatagtgga 
cctatctcgg tctattcttt tgatttataa 
aaaaatgagc tgatttaaca aaaatttaac 
atttaaatat ttgcttatac aatcttcctg 
gtgggtaccg agctcgaatt ctgtggaatg 
gctccccagg caggcagaag tatgcaaagc 
ggaaagtccc caggctcccc agcaggcaga 
gcaaccatag tcccgcccct aactccgccc 
cattctccgc cccatggctg actaattttt 
gcctctgagc tattccagaa gtagtgagga 
aagctcfccgg gagcttggat atccattttc 
tttcgcatga ttgaacaaga tggattgcac 
ctattcggct atgactgggc acaacagaca 
ctgtcagcgc aggggcgccc ggttcttttt 
gaactgcagg acgaggcagc gcggctatcg 
gctgtgctcg acgttgtcac tgaagcggga 
gggcaggatc tcctgtcatc tcaccttgct 
gcaatgcggc ggctgcatac gcttgatccg 
catcgcatcg agcgagcacg tactcggatg 
gacgaagagc atcaggggct cgcgccagcc 
cccgacggcg aggatctcgt cgtgacccat 
gaaaatggcc gcttttctgg attcatcgac 
caggacatag cgttggctac ccgtgatatt 
cgcttcctcg tgctttacgg tatcgccgct 
cttcttgacg agttcttctg agcgggactc 
ccaacctgcc atcacgagat ttcgattcca 
gaatcgtttt ccgggacgcc ggctggatga 
tcttcgccca ccccaacttg tttattgcag 
tcacaaattt cacaaataaa gcattttttt 
tcatcaatgt atcttatcat gtctggatcc 
gtcatagctg tttcctgtgt gaaattgtta 
cggaagcata aagtgtaaag cctggggtgc 
gttgcgctca ctgcccgctt tccagtcggg 
cggccaacgc gcggggagag gcggtttgcg 
tgactcgctg cgctcggtcg ttcggctgcg 
aatacggtta tccacagaat caggggataa 
gcaaaaggcc aggaaccgta aaaaggccgc 
ccctgacgag catcacaaaa atcgacgctc 
ataaagatac caggcgtttc cccctggaag 
gccgcttacc ggatacctgt ccgcctttct 
ctcacgctgt aggtatctca gttcggtgta 
cgaacccccc gttcagcccg accgctgcgc 
cccggtaaga cacgacttat cgccactggc 
gaggtatgta ggcggtgcta cagagttctt 
aaggacagta tttggtatct gcgctctgct 
tagctcttga tccggcaaac aaaccaccgc 
gcagattacg cgcagaaaaa aaggatctca 
tgacgctcag tggaacgaaa actcacgtta 
gatcttcacc tagatccttt taaattaaaa 
tgagtaaact tggtctgaca gttaccaatg 



tacgaacaaa tgataaaagg aaatagctga 8340 
aatgcbagag ctcgctgatc agcctcgact 8400 
tgcccctccc ccgtgccttc cttgaccctg 8460 
taaaatgagg aaattgcatc gcattgtctg 8520 
gtggggcagg acagcaaggg ggaggattgg 8580 
gtgggctcta tggcttctga ggcggaaaga 864 0 
gcgccctgta gcggcgcatt aagcgcggcg 8700 
acacttgcca gcgccctagc gcccgctcct 8760 
ttcgccggct ttccccgtca agctctaaat 8820 
gctttacggc acctcgaccc caaaaaactt 8880 
tcgccctgat agacggtttt tcgccctttg 8940 
ctcttgttcc aaactggaac aacactcaac 9000 
gggattttgg ggatttcggc ctattggtta 9060 
gcgaatttta acaaaatatt aacgtttaca 9120 
tttttggggc ttttctgatt atcaaccggg 9180 
tgtgtcagtt agggtgtgga aagtccccag 9240 
atgcatctca attagtcagc aaccaggtgt 9300 
agtatgcaaa gcatgcatct caattagtca 9360 
atcccgcccc taactccgcc cagttccgcc 9420 
tttatttatg cagaggccga ggccgcctcg 9480 
ggcttttttg gaggcctagg cttttgcaaa 9540 
ggatctgatc aagagacagg atgaggatcg 9600 
gcaggttctc cggccgcttg ggtggagagg 9660 
atcggctgct ctgatgccgc cgtgttccgg 9720 
gtcaagaccg acctgtccgg tgccctgaat 9780 
tggctggcca cgacgggcgt tccttgcgca 9840 
agggactggc tgctattggg cgaagtgccg 9900 
cctgccgaga aagtatccat catggctgat 9960 
gctacctgcc cattcgacca ccaagcgaaa 10020 
gaagccggtc ttgtcgatca ggatgatctg 10080 
gaactgttcg ccaggctcaa' ggcgcgcatg 10140 
ggcgatgcct gcttgccgaa tatcatggtg 10200 
tgtggccggc tgggtgtggc ggaccgc.tat 10260 
gctgaagagc ttggcggcga atgggctgac 10320 
cccgattcgc agcgcatcgc cttctatcgc 10380 
tggggttcga aatgaccgac caagcgacgc 10440 
ccgccgcctt ctatgaaagg ttgggcttcg 10500 
tcctccagcg cggggatctc atgctggagt 10560 
cttataatgg ttacaaataa agcaatagca 10620 
cactgcattc tagttgtggt ttgtccaaac 10680 
cgtcgacctc gagagcttgg cgtaatcatg 10740 
tccgctcaca attccacaca acatacgagc 10800 
ctaatgagtg agctaactca cattaattgc 10860 
aaacctgtcg tgccagctgc attaatgaat 10920 
tattgggcgc tcttccgctt cctcgctcac 10980 
gcgagcggta tcagctcact caaaggcggt 11040 
cgcaggaaag aacatgtgag caaaaggcca 11100 
gttgctggcg tttttccata ggctccgccc 11160 
aagtcagagg tggcgaaacc cgacaggact 11220 
ctccctcgtg cgctctcctg ttccgaccct 11280 
cccttcggga agcgtggcgc tttctcaatg 11340 
ggtcgttcgc tccaagctgg gctgtgtgca 11400 
cttatccggt aactatcgtc ttgagtccaa 11460 
agcagccact ggtaacagga ttagcagagc 11520 
gaagtggtgg cctaactacg gctacactag 11580 
gaagccagtt accttcggaa aaagagttgg 11640 
tggtagcggt ggtttttttg tttgcaagca 11700 
agaagatcct ttgatctttt ctacggggtc 11760 
agggattttg gtcatgagat tatcaaaaag 11820 
atgaagtttt aaatcaatct aaagtatata 11880 
cttaatcagt gaggcaccta tctcagcgat 11940 
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ctgtctattt cgttcatcca tagttgcctg actccccgtc gtgtagataa ctacgatacg 12000 
ggagggctta ccatctggcc ccagtgctgc aatgataccg cgagacccac gctcaccggc 12060 
tccagattta tcagcaataa accagccagc cggaagggcc gagcgcagaa gtggtcctgc 12120 
aactttatcc gcctccatcc agtctattaa ttgttgccgg gaagctagag taagtagttc 121B0 
gccagttaat agtttgcgca acgttgttgc cattgctaca ggcatcgtgg tgtcacgctc 12240 
gtcgtttggt atggcttcat tcagctccgg ttcccaacga tcaaggcgag ttacatgatc 123 00 
ccccatgttg tgcaaaaaag cggttagctc cttcggtcct ccgatcgttg tcagaagtaa i2360 
gttggccgca gtgttatcac tcatggttat ggcagcactg cataattctc ttactgtcat 12420 
gccatccgta agatgctttt ctgtgactgg tgagtactca accaagtcat tctgagaata 12480 
gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaata cgggataata ccgcgccaca 12540 
tagcagaact ttaaaagtgc tcatcattgg aaaacgttct tcggggcgaa aactctcaag 12600 
gatcttaccg ctgttgagat ccagttcgat gtaacccact cgtgcaccca actgatcttc 12660 
agcatctttt actttcacca gcgtttctgg gtgagcaaaa acaggaaggc aaaatgccgc 12720 
^aaaaaggga ataagggcga cacggaaatg ttgaatactc atactcttcc tttttcaata 12780 
ttattgaagc atttatcagg gttattgtct catgagcgga tacatatttg aatgtattta 12840 
gaaaaataaa caaatagggg ttccgcgcac atttccccga aaagtgccac ctgacgtcga 12900 
cggatcggga gatctcccga tcccctatgg tcgactctca gtacaatctg ctctgatgcc 12960 
gcatagttaa gccagtatct gctccctgct tgtgtgttgg aggtcgctga gtagtgcgcg 13020 
agcaaaattt aagctacaac aaggcaaggc ttgaccgaca attgcatgaa gaatctgctt 13080 
agggttaggc gttttgcgct gcttcgcgat gtacgggcca gatatacgcg ttgacattga 13140 
ttattgacta gttattaata gtaatcaatt acggggtcat tagttcatag cccatatatg 13200 
gagttccgcg ttacataact tacggtaaat ggcccgcctg gctgaccgcc caaqgacccc 13260 
cgcccattga cgtcaataat gacgtatgtt cccatagtaa cgccaatagg gactttccat 13320 
tgacgtcaat gggtggacta tttacggtaa actgcccact c 13361 



<210> 14 
<211> 9569 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note « 
synthetic construct 

<400> 14 

atgcggtttt ggcagtacat caatgggcgt ggatagcggt ttgactcacg gggatttcca 60 
agtctccacc ccattgacgt caatgggagt ttgttttggc accaaaatca acgggacttt 120 • 
ccaaaatgtc gtaacaactc cgccccattg acgcaaatgg gcggtaggcg tgtacggtgg 180 
gaggtctata taagcagagc tctgtgaaac ttcgaggagt ctctttgttg aggacttttg 240 
agttctccct tgaggctccc acagatacaa taaatatttg agattgaacc ctgtcgagta 300 
tctgtgtaat cttttttacc tgtgaggtct cggaatccgg gccgagaact tcgcagttgg 360 
cgcccgaaca gggacttgat tgagagtgat tgaggaagtg aagctagagc aatagaaagc 420 
tgttaagcag aactcctgct gacctaaata gggaagcagt agcagacgct gctaacagtg 480 
agtatctcta gtgaagcgga ctcgagctca taatcaagtc attgtttaaa ggcccagata 540 
aattacatct ggtgactctt cgcggacctt caagccagga gattcgccga gggacagtca 600 
acaaggtagg agagattcta cagcaacatg gggaatggac aggggcgaga ttggaaaatg 660 
gccattaaga gatgtagtaa tgttgctgta ggagtagggg ggaagagtaa aaaatttgga 720 
gaagggaatt tcagatgggc cattagaatg gctaatgtat ctacaggacg agaacctggt 780 
gatataccag agactttaga tcaactaagg ttggttattt gcgatttaca agaaagaaga 840 
gaaaaatttg gatctagcaa agaaattgat atggcaattg tgacattaaa agtctttgcg 900 
gtagcaggac ttttaaatat gacgggtgtc tactgctgct gcagctgaaa atatgtattc 960 
tcaaatggga ttagacacta ggccatctat gaaagaagca ggtggaaaag aggaaggccc 1020 
tccacaggca tatcctattc aaacagtaaa tggagtacca caatatgtag cacttgaccc 1080 
aaaaatggtg tccattttta tggaaaaggc aagagaagga ctaggaggtg aggaagttca 1140 
actatggttt actgccttct ctgcaaattt aacacctact gacatggcca cattaataat 1200 
ggccgcacca gggtgcgctg cagataaaga aatattggat gaaagcttaa agcaactgac 1260 
agcagaatat gatcgcacac atccccctga tgctcccaga ccattaccct attttactgc 1320 
agcagaaatt atgggtatag gattaactca agaacaacaa gcagaagcaa gatttgcacc 1380 
agctaggatg cagtgtagag catggtatct cgaggcatta ggaaaattgg ctgccataaa 1440 
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agctaagtct cctcgagctg tgcagttaag 
tatagacaga ttgtttgccc aaatagatca 
tttaaaacag tcattgagca tagctaatgc 
ccttaagcca gaaagtaccc tagaagaaaa 
aggatataaa atgcaactct tggcagaagc 
aggatcagga ccagtgtgtt ttaattgtaa 
agaagtgaaa aaatgtaata aatgtggaaa 
aggaaataga aagaattgta caagggaaga 
attgggtagg atggatagga aatattccac 
tgggaatagg attaggagtg ttattattga 
taagaaattg tatccacaag atactaggat 
gagaagaaat acaaccacaa atggaattga 
aaaaagagga ggaatgatga agtatctcag 
gttcttccct ttgaggaagg tatgtcatat 
gcctgctatt gtcttcccaa tcctccccct 
tagaatgaca cctactcaga caatgcgatg 
tgggagtggc accttccagg gtcaaggaag 
ggcaactaga aggcacagtc gaggctgatc 
cttccagatt ttctagctga tgattagttc 
ctctagatgt tctgacgaat tcgagtggag 
atacagaact ctctgaagca tcgtttgaag 
ttgaggcagc tgtttgtgta acctagagca 
gtagtttctg ggattgtaac caaggcttta 
cagacgatga acacagccac caccaccagc 
tccttttctt tggagccaga gagcatgcgg 
tagcacacgg taatgatgag cactggcata 
agcaggtttt cccagtcicca ggttggatga 
tgcctgtatt ttgttgtagc catgaacatt 
cagttgcaga cattgataat tttggcattt 
gggtggcaga ctgcaatgta tcgatcaaca 
aacatgttat agtaatctat ggagatcact 
gttcccatta ggtaattcac actctggaag 
agagcaaggt tgaaaatgta gatgttggtg 
acatacatga ccaggaagtt tccgaagagc 
atgatcgtga tggccgtgat catggaggga 
ccgcccaggt cggtgcggtt cggaccgcat 
aagttgaccc aggaaccggg gctgggtgct 
gtgcaattgc tggcgttcgt gggggcagcg 
gcaccgctgt gcgaggtagc caggagcacc 
gcttcctttc gccgctgcca cagctcctgc 
acagagagga gcgtcaggcg gaggggaccg 
tagccccctc ccaccttagt agttcacaga 
cttggccgcg ggtacaattc cgcagctttt 
agaagtaaag gcaacatcca ctgaggagca 
ggacctgaaa taaaagacaa aaagactaaa 
cctcgaggac tgcagactca gccggtggcg 
gagcaggagt ggcggcggct gcggctccca 
cctatagagc cgagcgagcg catgtgaccc 
ggagacgcac gtacgagcgc gggtggtggt 
ggcgatccag gaccaagctg gcaaccccat 
ccaactcgag ggaagcaggc gtccccttat 
agatggagaa gcccaccaca ggggttcacg 
tatttgaaag ctgaactgat ccatcacagc 
ccctactagc atcttagtcg cctcggatct 
ggcccacaga acaagccaca cagagatgac 
ggatttccag tctgggacaa aaaggaacat 
cggtggtccc cactggggta ttatgagttc 
aagagaagac aagtgaactg tgtgccctac 
ctctctgcct caatctgccc tcctttcgcc 
tcagattttt agtgcaggat cttttccctg 
cacctccttc ccttcatgcg ctcagggttc 



acaaggagct aaggaagatt attcatcctt 1500 
agaacaaaat acagctgaag ttaagttata 1560 
taatgcagac tgtaaaaagg caatgagcca 1620 
gttgagagct tgtcaagaaa taggctcacc 1680 
tcttacaaaa gttcaagtag tgcaatcaaa 1740 
aaaaccagga catctagcaa gacaatgtag 1800 
acctggtcat gtagctgcca aatgttggca 1860 
aagggataca acaattacaa aagtgggaag 1920 
aatatttaaa gggactattg ggaggtatct 1980 
ttttatgttt acctacattg gttgattgta 2040 
acacagtaat tgcaatgcct gaagtagaag 2100 
ggagaaatgg taggcaatgt ggcatgtctg 2160 
acttatttta taagggagat actgtgctga 2220 
gaatccattt cgaatcaaat tccccagcat 2280 
tgctgtcctg ccccacccca ccccccagaa 2340 
caatttcctc attttattag gaaaggacag 2400 
gcacggggga ggggcaaaca acagatggct 2460 
agcgagctct gggcaacgga gcagtttctg 2520 
tatccactgt attggccgtg gaggggtggt 2580 
ttttgttgct caatgttgga agaggttggg 2640 
ttttcatcca gaaatgcata aaggactggg 2700 
atgcagaagt gccaagaaac agtctggaac 2760 
atgatgacgt aaatgtgaat gggagtccag 2820 
accatcctgg tgatccttcg aagattcctg 2880 
acactcttga ggcgcaagat catcagtcca 2940 
atgaaggcga agatgaaaac acagatcttc 3000 
gagaatgtta gtgtacaatc tatggaacct 3060 
acaggaagac caatggctga agagaggatc 3120 
cggggagtac ggaaatctaa ggccttgaca 3180 
ctcatggtgc agagggtgaa tatgctggtg 3240 
atcttgcaaa ggatggttcc aaatggccat 3300 
ggcagggtac tggtggctaa ggcatctgcc 3360 
gcagtcttca tcttggtgta tctgacaatc 3420 
cccaccacgc acacgatgga gtagagggcc 3480' 
ctgccggtcg gagggcacag gctgtctctc 3540- 
gggtcggaca ggttgccatc taagtgggac 3600. 
ggggagcaac ttgagtacgc caaggcatca 3660' 
ctgctgtcca tggtactgac ggccgggcgg 372 0 
gagacttttc gggttccaag cgcctcagcc 3780 
tgtttcttac agaaaccagt cctggctgag 3840 
agctgagcat ctgacattct cctctgcgta 3900 
ggctcatccg gaattcgata tcaagcttag 3960 
agagcagaag taacacttcc gtacaggcct 4020 
gttctttgat ttgcaccacc accggatccg 4080 
cttaccagtt aactttctgg tttttcagtt 4140 
gtggcggtgg tgacggcggc ggcgcagaga 4200 
gcgcgggcgg ggggtgggca gggggcggcg 4260 
gggggccccg atgagtcaag agctgcaggc 4320 
ggggaaccga acgggtgggg gaagggcccg 4380 
aagtacacgg atgctcataa ccttgtccag 4440 
tccagtatag tggcaggtac taacccagac 4500 
aactcgacac agaatataaa aacgcctttt 4560 
ccacctcctc ggctcttctc tccagacaac 4620 
gcttgagagc tgtgcttagg actttaggta 4680 
caccacagaa cctgagcctc gggatgtaat 4740 
ctcccacttc ttgtaacagg gctatgaatg 4800 
ttgaggtcag catctytggc ctcctggaaa 4860 
cggaatgccc acatccccag gcctctcagt 4920 
tctccacccc gtgtcctgtt atttttagct 4980 
gcccagctac atcctccata taccaccagc 5040 
atcccttcct cccagtctct accagctatg 5100 
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tctgtagaga cagcggtcct ctgcctccct 
gtgtcttttc ttacctcttc ctgagcctct 
cagcctctga ctctggcctc tcctcccgcg 
tcctgccatc acctttactg tccctgcagg 
cggttccatg tcttggtccg ggtcgagttc 
gatctcctca gcttctgccc ggtcctcgtc 
gtgtgccatc tgcaccaggg cctcctggcg 
cagcagcacc aaggccctca gcttgggcag 
actgatcatg ttactcctgt atatgaggtc 
gagtgttagc tacagatcgg gcttcctttg 
cctggaagca gtgatggatg aggaccagcc 
ccacccttta gagtccctaa atgccgtgca 
aagctaggcc tcatcctgtc ttatatttgg 
agacattttc tgtgtcagcc ataccaccat 
aggaaatgga attgaggaga aatggtaggc 
gatgaagtat ctcagactta ttttataagg 
gaaggtatgt catatgaatc catttcgaat 
tgggatgagt attggaaccc tgaag^aata 
acgaacaaat gataaaagga aatagctgag 
cctaaccgca aaaccacatc ctatggaaag 
taagagtata taaccagtgc tttgtgaaac 
agttctccct tgaggctccc acagatacaa 
tctgtgtaat cttttttacc tgtgaggtct 
cgaccaagcg acgcccaacc tgccatcacg 
aaggttgggc ttcggaatcg ttttccggga 
tctcatgctg gagttcttcg cccaccccaa 
at'aaagcaat agcatcacaa atttcacaaa 
tggtttgtcc aaactcatca atgtatctta 
ttggcgtaat catggtcata gctgtttcct 
cacaacatac gagccggaag cataaagtgt 
ctcacattaa ttgcgttgcg ctcactgccc 
ctgcattaat gaatcggcca acgcgcgggg 
gcttcctcgc tcactgactc gctgcgctcg 
cactcaaagg cggtaatacg gttatccaca 
tgagcaaaag gccagcaaaa ggccaggaac 
cataggctcc gcccccctga cgagcatcac 
aacccgacag gactataaag ataccaggcg 
cctgttccga ccctgccgct taccggatac 
gcgctttctc aatgctcacg ctgtaggtat 
ctgggctgtg tgcacgaacc ccccgttcag 
cgtcttgagt ccaacccggt aagacacgac 
aggattagca gagcgaggta tgtaggcggt 
tacggctaca ctagaaggac agtatttggt 
ggaaaaagag ttggtagctc ttgatccggc 
tttgtttgca agcagcagat tacgcgcaga 
ttttctacgg ggtctgacgc tcagtggaac 
agattatcaa aaaggatctt cacctagatc 
atctaaagta tatatgagta aacttggtct 
cctatctcag cgatctgtct atttcgttca 
ataactacga tacgggaggg cttaccatct 
ccacgctcac cggctccaga tttatcagca 
agaagtggtc ctgcaacttt atccgcctcc 
agagtaagta gttcgccagt taatagtttg 
gtggtgtcac gctcgtcgtt tggtatggct 
cgagttacat gatcccccat gttgtgcaaa 
gttgtcagaa gtaagttggc cgcagtgtta 
tctcttactg tcatgccatc cgtaagatgc 
tcattctgag aatagtgtat gcggcgaccg 
aataccgcgc cacatagcag aactttaaaa 
cgaaaactct caaggatctt accgctgttg 
cccaactgat cttcagcatc ttttactttc 



ctctaattct tctcccttct ctggtctctt 5160 
ctcatctttc cttcatatct gagatacccg 5220 
tgcccagcct ttgacctccg ggggctgcct 5280 
ctagaggagc cactaagttg gcgggaggta 5340 
ttgctcctgt tcctccttca gcctctgtcg 5400 
ctcatagtac tctttgtcta ggcgctctag 5460 
gtagtcagtc tcatcggcac aggggttgtc 5520 
gtcccgcagt ttggccagct cccccaggtc 5580 
gggaggggtt aacgcaggaa aatgggctct 5640 
taagaatacc tcctggaagc cactctgggc 5700 
ttggcatgat gcaattccct cattcctcct 5760 
atgtgttctt ttaaaacttt ggagcagcaa 5820 
agaagagggg caaatgtccc atacaggagc 58 80 
agggggttgg aaggattggg cgagcagagg 5940 
aatgtggcat gtctgaaaaa gaggaggaat 6000 
gagatactgt gctgagttct tccctttgag 6060 
caaattccta aaaaagaaaa aagggtggac 6120 
gaaagaatgc ttatggacta gggactgttt 6180 
catgactcat agttaaagcg ctagcagctg 6240 
cttgctaatg acgtataagt tgttccattg 6300 
ttcgaggagt ctctttgttg aggacttttg 6360 
taaatatttg agattgaacc ctgtcgagta 6420 
cggaatccgg gccgagaact tcgcagtgac 6480 
agatttcgat tccaccgccg ccttctatga 6540 
cgccggctgg atgatcctcc agcgcgggga 6600 
cttgtttatt gcagcttata atggttacaa 6660 
taaagcatt.t ttttcactgc attctagttg 6720 
tcatgtctgg atcccgtcga cctcgagagc 6780 
gtgtgaaatt gttatccgct cacaattcca 6840 
aaagcctggg gtgcctaatg agtgagctaa 6900 
gctttccagt cgggaaacct gtcgtgccag 6960 
agaggcggtt tgcgtattgg gcgctcttcc 702 0 
gtcgttcggc tgcggcgagc ggtatcagct 7080 
gaatcagggg ataacgcagg aaagaacatg 7140 
cgtaaaaagg ccgcgttgct ggcgtttttc 7200 
aaaaatcgac gctcaagtca gaggtggcga 7260 
tttccccctg gaagctccct cgtgcgctct 7320 
ctgtccgcct ttctcccttc gggaagcgtg 73 80 
ctcagttcgg tgtaggtcgt tcgctccaag 7440 
cccgaccgct gcgccttatc cggtaactat 7500 
ttatcgccac tggcagcagc cactggtaac 7560 
gctacagagt tcttgaagtg gtggcctaac 7620 
atctgcgctc tgctgaagcc agttaccttc 7680 
aaacaaacca ccgctggtag cggtggtttt 7740 
aaaaaaggat ctcaagaaga tcctttgatc 7800 
gaaaactcac gttaagggat tttggtcatg 7860 
cttttaaatt aaaaatgaag ttttaaatca 7920 
gacagttacc aatgcttaat cagtgaggca 7980 
tccatagttg cctgactccc cgtcgtgtag 8040 
ggccccagtg ctgcaatgat accgcgagac 8100 
ataaaccagc cagccggaag ggccgagcgc 8160 
atccagtcta ttaattgttg ccgggaagct 822 0 
cgcaacgttg ttgccattgc tacaggcatc 8280 
teat t cage t ccggttccca acgatcaagg 8340 
aaagcggtta gctccttcgg tcctccgatc 8400 
tcactcatgg ttatggcagc actgcataat 8460 
ttttctgtga ctggtgagta ctcaaccaag 8520 
agttgctctt gcccggcgtc aatacgggat 8580 
gtgctcatca ttggaaaacg ttcttcgggg 8640 
agatccagtt cgatgtaacc cactcgtgca 8700 
accagcgttt ctgggtgagc aaaaacagga 8760 
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aggcaaaatg ccgcaaaaaa gggaataagg 
ttcctttttc aatattattg aagcatttat 
tttgaatgta tttagaaaaa taaacaaata 
ccacctgacg tcgacggatc gggagatctc 
tctgctctga tgccgcatag ttaagccagt 
ctgagtagtg cgcgagcaaa atttaagcta 
tgaagaatct gcttagggtt aggcgttttg 
cgcgttgaca ttgattattg actagttatt 
atagcccata tatggagttc cgcgttacat 
pgcccaacga cccccgccca ttgacgtcaa 
tagggacttt ccattgacgt caatgggtgg 
tacatcaagt gtatcatatg ccaagtacgc 
ccgcctggca ttatgcccag tacatgacc.t 
acgtattagt catcgctatt accatggtg 



gcgacacgga aatgttgaat actcatactc B820 
cagggttatt gtctcatgag cggatacata B880 
ggggttccgc gcacatttcc ccgaaaagtg 8940 
ccgatcccct atggtcgact ctcagtacaa 9000 
atctgctccc tgcttgtgtg ttggaggtcg 9060 
caacaaggca aggcttgacc gacaattgca 9120 
cgctgcttcg cgatgtacgg gccagatata 9180 
aatagtaatc aattacgggg tcattagttc 9240 
aacttacggt aaatggcccg cctggctgac 9300 
taatgacgta tgttcccata gtaacgccaa 9360 
actatttacg gtaaactgcc cacttggcag 9420 
cccctattga cgkcaatgac ggtaaatggc 9480 
tatgggactt tcctacttgg cagtacatct 9540 

9569 



<210> 15 
<211> 401 
<212> PRT 

<213> Artificial* Sequence 
<220> 

<223> Description of Artificial 
synthetic construct 
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Ser 
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Val 


Gly 


Leu 


Phe 
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Val 


Asn 
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Met 
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Thr 
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He 


Val 


He 
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He 
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Tyr 


Tyr 
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Leu 


Cys 


Thr 


Met 


Ser 
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Arg 
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Asn 


Trp 


He 


Leu 


Ser 


Ser 


Ala 






195 










200 


Ala 


Thr 


Thr 


Lys 


Tyr 


Arg 


Gin 


Gly 




210 










215 




Ser 


His 


Pro 


Thr 


Trp 


Tyr 


Trp 


Glu 


225 










230 






He 


Phe 


Ala 


Phe 


He 


Met 


Pro 


He 



245 



Sequence ; /note = 



Asn 


Ala 


Ser 


Asn Cys 


Thr 


Asp 


Pro 




10 








15 




Pro 


Ala 


Pro 


Ser Pro 


Ser 


Ser Trp 


25 








30 






Asn 


Leu 


Ser 


Asp Pro 


Cys 


Gly Pro 








45 








Asp 


Arg 


Leu 


Cys Pro 


Ser 


Ala Gly 








60 








He 


He 


Met 


Ala Leu 


Tyr 


Ser 


He 






75 








80 


Asn 


Phe 


Leu 


Val Met 


Tyr 


Val 


He 




90 








95 




Ala 


Thr 


Asn 


He Tyr 


He 


Phe 


Asn 


105 








110 






Thr 


Ser 


Thr 


Leu Pro 


Phe 


Gin 


Ser 








125 








Pro 


Phe 


Gly 


Thr He 


Leu 


Cys 


Lys 








140 








Asn 


Met 


Phe 


Thr Ser 


He 


Phe 


Thr 






155 








160 


Tyr 


He 


Ala 


Val Cys 


His 


Pro 


Val 




170 








175 




Arg 


Asn 


Ala 


Lys He 


He 


Asn 


He 


185 








190 






He 


Gly 


Leu 


Pro Val 


Met 


Phe 


Met 








205 








Ser 


He 


Asp 


Cys Thr 


Leu 


Thr 


Phe 








220 








Asn 


Leu 


Leu 


Lys He 


Cys 


Val 


Phe 






235 








240 


Leu 


He 


He 


Thr Val 


Cys 


Tyr Gly 




250 








255 
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Leu 


Met 


lie Leu Arg 


Leu 


Lys Ser 


Val Arg Met 


Leu Ser 


Gly 


Ser 


Lys 






260 






265 




270 






Glu 


Lys 


Asp Arg Asn 


Leu 


Arg Arg 


lie Thr Arg 


Met Val 


Leu 


Val 


Val 






275 




280 




285 








Val 


Ala 


Val Phe lie 


Val 


Cys Trp 


Thr Pro lie 


His lie 


Tyr 


Val 


lie 




290 






295 




300 








lie 


Lys 


Ala Leu lie 


Thr 


lie Pro 


Glu Thr Thr 


Phe Gin 


Thr 


Val 


Ser 


305 






310 




315 








320 


Trp 


His 


Phe Cys lie 


Ala 


Leu Gly 


lyr Thr Asn 


Ser Cys 


Leu 


Asn 


Pro 






325 






330 






335 




Val 


Leu 


Tyr Ala Phe 


Leu 


Asp Glu 


Asn Phe Lys 


Arg Cys 


Phe 


Arg 


Glu 






340 






345 




350 






Phe 


Cys 


lie Pro Thr 


Ser 


Ser Thr 


lie Glu Gin 


Gin Asn 


Ser 


Thr 


Arg 






355 




360 




365 








lie 


Arg 


Gin Asxi Thr 


Arg 


Asp His 


Pro Ser Thr 


Ala Asn 


Thr 


Val 


Asp 




370 






375 




380 








Arg 


Thr 


Asn His Gin 


Leu 


Glu Asn 


Leu Glu Ala 


Glu Thr 


Thr 


Pro 


Leu 


385 






390 




395 








400 


Pro 





















<210> 16 
<211> 1415 
<212> DNA 

<213> Artificial Seqpzence 
<220> 

<223> Description of Artificial Sequence : /note « 
synthetic construct 

<400> 16 

gcctgacgct cctctctggc tccgccgggg ttggtcgctg taagaaataa caggagctgt 60 
ggcagcggcg aagacgaagc ggctcgcgcg tggaacccga aaagtcaggg tgctcgcggt 120 
tactcccaac gtggtcccag ccggcggtca gcaccatgga cagcggcgcc gtccccacga 180 
acgccagcaa ctgcactgat cccttcacac acccttcaag ttgctcccca gcacctagtc 240 
ccagctcctg ggtcaacttc tcccacttag aaggcaacct gtccgaccca tgcggtccga 300 
accgcaccga gctgggaggg agcgacagac tgtgcccttc ggccggcagc ccttccatga 360 
tcacggccat catcatcatg gccctctact ccatcgtgtg cgtggtgggg ctcttcggaa 420 
acttcctggt catgtatgtg attgtcaggt acaccaaaat gaagactgcc accaacatct 480 
atattttcaa cctcgccctg gcagatgccc tggcaaccag taccctgcct ttccagagtg 540 
tcaattacct gatgggaaca tggccgtttg gaaccatcct gtgcaagatt gtgatctcca 600 
tagattacta caatatgttc accagcatat tcaccctctg caccatgagt gtggatcgct 660 
acattgcagt ctgccatcct gtcaaggccc tggatttacg cactccccgt aatgccaaga 720 
tcatcaacat ctgcaactgg atcctctctt cagccattgg tctgcctgtg atgttcatgg 780 
caacgacaaa gtaccggcaa ggttccatag attgtacact aacattctct cacccaacgt 840 
ggtactggga aaacctgctg aaaatctgtg ttttcatctt tgccttcatc atgcctatcc 900 
tcatcattac agtgtgttat gggctgatga tcttacgcct caagagtgtc cgcatgctct 960 
ctggctccaa agaaaaggac aggaacctgc gaagaatcac caggatggtg ctggtggttg 1020 
tggctgtgtt cattgtctgc tggacgccca ttcacatcta cgtcatcatt aaagccttga 1080 
tcacaatccc ggaaactact ttccagaccg tttcctggca cttctgcatt gctctaggtt 1140 
ataccaacag ttgcctcaac cccgtccttt atgcatttct ggatgaaaac ttcaaacgat 1200 
gcttcagaga gttctgtatc ccaacttcct ccaccattga gcagcaaaac tccactcgaa 1260 
ttcgtcagaa caccagagac cacccctcca cagccaatac ggtggatagg actaaccatc 1320 
agctagaaaa tctggaagca gaaaccactc cgttacccta actgggtctc ataccattca 1380 
gaccctcact gagcttagac gccacatcta tatga 1415 

<210> 17 
<211> 398 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence : /note « 
synthetic construct 

<400> 17 

Met Asp Ser Ser Ala Gly Pro Gly Asn lie Ser Asp Cys Ser Asp Pro 

1 5 10 15 

Leu Ala Pro Ala Ser cys Ser Pro Ala Pro Gly Ser Trp Leu Asn Leu 

20 25 30 

Ser His Val Asp Gly Asn Gin Ser Asp Pro Cys Gly Pro Asn Arg Thr 

35 40 45 

Gly Leu Gly Gly Ser His Ser Leu Cys Pro Gin Thr Gly Ser Pro Ser 

50 55 60 

Met Val Thr Ala lie Thr He Met Ala Leu Tyr Ser He Val Cys Val 
65 70 75 80 

Val Gly Leu Phe Gly Asn Phe Leu Val Met Tyr Val He Val Arg Tyr 

85 90 95 

Thr Lys Met Lys Thr Ala Thr Asn He Tyr He Phe Asn Leu Ala Leu 

100 105 110 

Ala Asp Ala Leu Ala Thr Ser Thr Leu Pro Phe Gin Ser Val Asn Tyr 

115 120 125 

Leu Met Gly Thr Trp Pro Phe Gly Asn He Leu Cys Lys He Val He 

130 135 140 

Ser He Asp Tyr Tyr Asn Met Phe Thr Ser He Phe Thr Leu Cys Thr 
145 150 155 160 

Met Ser Val Asp Arg Tyr He Ala Val Cys hIb Pro Val Lys Ala Leu 

165 170 175 

Asp Phe Arg Thr Pro Arg Asn Ala Lys He Val Asn Val Cys Asn Trp 

180 185 190 

He Leu Ser Ser Ala Xle Gly Leu Pro Val Met Phe Met Ala Thr Thr 

195 200 205 

Lys Tyr Arg Gin Gly Ser He Asp Cys Thr Leu Thr Phe Ser His Pro 

210 215 220 

Thr Trp Tyr Trp Glu Asn Leu Leu Lys He Cys Val Phe He Phe Ala 
225 230 235 240 

Phe He Met Pro Val Leu He He Thr Val Cys Tyr Gly Leu Met He 

245 250 255 

Leu Arg Leu Lys Ser Val Arg Met Leu Ser Gly Ser Lys Glu Lys Asp 

260 265 270 

Arg Asn Leu Arg Arg He Thr Arg Met Val Leu Val Val Val Ala Val 

275 280 285 

Phe He Val Cys Trp Thr Pro He His He Tyr Val He He Lys Ala 

290 295 300 

Leu He Thr He Pro Glu Thr Thr Phe Gin Thr Val Ser Trp His Phe 
305 310 315 320 

Cys He Ala Leu Gly Tyr Thr Asn Ser Cys Leu Asn Pro Val Leu Tyr 

325 330 335 

Ala Phe Leu Asp Glu Asn Phe Lys Arg Cys Phe Arg Glu Phe Cys He 

340 345 350 

Pro Thr Ser Ser Thr He Glu Gin Gin Asn Ser Ala Arg He Arg Gin 

355 360 365 

Asn Thr Arg Glu His Pro Ser Thr Ala Asn Thr Val Asp Arg Thr Asn 

370 375 380 

His Gin Leu Glu Asn Leu Glu Ala Glu Thr Ala Pro Leu Pro 
385 390 395 



<210> 18 
<211> 2229 
<212> DMA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 18 

cggatcctta gcatccccaa agcgcctccg tgtacttcta aggtgggagg gggatacaag 60 
cagaggagaa tatcggacgc tcagacgttc cattctgcct gccgctcttc tctggttcca 120 
ctagggcttg tccttgtaag aaactgacgg agcctagggc agctgtgaga ggaagaggct 180 
ggggcgcctg gaacccgaac actcttgagt gctctcagtt acagcctacc gagtccgcag 240 
caagcattca gaaccatgga cagcagcgcc ggcccaggga acatcagcga ctgctctgac 300 
cccttagctc ctgcaagttg gtccccagca cctggctcct ggctcaactt gtcccacgtt 360 
gatggcaacc agtccgaccc atgcggtcct aaccgcacgg ggcttggcgg gagccacagc 420 
ctgtgccctc agaccggcag cccttccatg gtcacagcca tcaccatcat ggccctctat 480 
tctatcgtgt gtgtagtggg cctctttgga aacttcctgg tcatgtatgt gattgtaaga 540 
tataccaaaa tgaagactgc caccaacatc tacattttca accttgctct ggcagatgcc 600 
ttagccacta gcacgctgcc ctttcagagt gttaactacc tgatgggaac gtggcccttt 660 
ggaaacatcc tctgcaagat cgtgatctca atagactact acaacatgtt caccagtatc 720 
ttcaccctct gcaccatgag tgtagaccgc tacattgccg tctgccaccc ggtcaaggcc 780 
ctggatttcc gtaccccccg aaatgccaaa attgtcaatg tctgcaactg gatcctctct 840 
tctgccattg gtctgcccgt aatgttcatg gcaaccacaa aatacaggca ggggtccata 900 
gattgcaccc tcactttctc tcatcccaca tggtactggg agaacctgct caaaatctgt 960 
gtcttcatct tcgccttcat catgccggtc ctcatcatca ctgtgtgtta tggactgatg 1020 
atcttacgac tcaagagtgt ccgcatgctg tcgggctcca aagaaaagga caggaacctg 1080 
cgcaggatca cccggatggt gctggtggtc gtggctgtat ttattgtctg ctggaccccc 1140 
atccacatct atgtcatcat caaagcactg atcacgattc cagaaaccac tttccagact 1200 
gtttcctggc acttctgcat tgccttgggt tacacaaaca gctgcctgaa cccagttctt 1260 
tatgcgttcc tggatgaaaa cttcaaacga tgttttagag agttctgcat cccaacttcc 1320 
tccacaatcg aacagcaaaa ctctgctcga atccgtcaaa acactaggga acacccctcc 1380 
acggctaata cagtggatcg aactaaccac cagctagaaa atctggaagc agaaactgct 1440 
ccattgccct aactgggtcc cacgccatcc agaccctcgc taaacttaga ggctgccatc 1500 
tacttggaat caggttgctg tcagggtttg tgggaggctc tggtttcctg gaaaagcatc 1560 
tgatcctgca tcattcaaag tcattcctct ctggctattc acgctacacg tcagagacac 1620 
tcagactgtg tcaagcactc agaaggaaga gactgcaggc cactactgaa tccagctcat 1680 
gtacagaaac atccaatgga ccacaatact ctgtggtatg tgatttgtga tcaacataga 1740 
aggtgaccct tccctatgtg gaatttttaa tttcaaggaa atacttatga tctcatcaag 1800 
ggaaaaatag atgtcacttg ttaaattcac tgtagtgatg cataaaggaa aagctacctc 1860 
tgacctctag cccagtcacc ctctatggaa agttccatag ggaatatgtg agggaaaatg 1920 
ttgcttccaa attaaatttt cacctttatg ttatagtcta gttaagacat caggggcatc 1980 
tctgtttctt ggttttgtat tgtttgaaag aagacatctt cctccctagc tgcgtgttga 2040 
aaatgaaagg gatttaaaac acagtgtcaa ctgcagaata gttgattctc gcactgaagg 2100 
gggggggcta atcttcccaa ttctttccat gtcctccaag tgttcacaag gtcaaactca 2160 
gagtcaccca gtaagctcat catgccacca ttctgagcaa aatccttgga ttcctgctca 2220 
gaatggtgg 2229 

<210> 19 
<211> 398 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 19 

Met Asp Ser Ser Thr Gly Pro Gly Asn Thr Ser Asp Cys Ser Asp Pro 

15 10 15 

Leu Ala Gin Ala Ser - Cys Ser Pro Ala Pro Gly Ser Trp Leu Asn Leu 
20 25 30 
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Ser 


His 


Val 


Asp Oly Asn 


Gin 


Ser 






35 






40 


Gly 


Leu 


Gly 


Gly Asn Asp 


Ser 


Leu 




50 






55 




Met 


Val 


Thr 


Ala He Thr 


He' 


Met 


65 






70 






Val 


Gly 


Leu 


Phe Gly Asn 


Phe 


Leu 








85 






Thr 


Lys 


Met 


Lys Thr Ala 


Thr 


Asn 








100 






Ala 


Asp 


Ala 


Leu Ala Thr 


Ser 


Thr 






115 






120 


Leu 


Met 


Gly 


Thr Trp Pro 


Phe Gly 




130 






135 




Ser 


He 




Tyr Tyr Asn 


Met 


Phe 


145 






150 






Met 


Ser 


Val 


Asp Arg Tyr 


He 


Ala 








165 






Asp 


Phe 


Arcr 


Thr Pro Arg 


Asn 


Ala 








180 






He 


Leu 


Ser 


Ser Ala He 


Gly Leu 






195 






200 


Lys 


Tyr 


Arg 


Gin Gly Ser 


He Asp 




210 






215 




Thr 


Tro 


Tyr 


Trp ' Glu Asn 


Leu 


Leu 


225 






230 






Phe 


He 


Met 


Pro Val Leu 


•He- 


He 








245 






Leu 


Atq 


Leu 


Lys Ser Val 


Arg Met 








260 






Arcr 


Asn 


Leu 


Arg Arg He 


Thr Arg 






275 






280 


Phe 


He 


Val 


Cys T3Cp Thr 


Pro 


He 




290 






295 




Leu 


He 


Thr 


He Pro Glu 


Thr 


Thr 


305 






310 






Cvs 


He 


Ala 


Leu Gly Tyr 


Thr 


Asn 








325 






Ala 


Phe 


Leu 


AsT!> Glu Abu 

^*0^ V^abU 


Phe 


Lys 








340 






Pro 


Thr 


Ser 


Ser Thr He 


Glu 


Gin 






355 






360 


Asn 


Thr 


Arg 


Glu His Pro 


Ser 


Thr 




370 






375 




His 


Gin 


Leu 


Glu Asn Leu 


Glu 


Ala 


385 






390 







Asp Pro 


Cys 


Glv 


Leu 


Asn 


Aro 


Thr 








45 








Cys Pro 


Gin 


Thr 


Glv 


Ser 


Pro 


•9CX 






60 










Al ^ TiPii 




Ov?X 


X xc; 


Val 


y ^ 


Val 

VAX 




75 










80 


Val Met 




Val 


He 


Val 


Ai"cr 
rixy 


xyx 


90 










95 




He Tyr 


He 


Phe 


Asn 


Leu 


Ala 


Leu 


105 








110 






Leu Pro 


Phe 


Gin 


Ser 


Val 


Asn 


xjrx 








125 








Thr He 


Leu 


Cys 


xiy» 


He 


Val 
vax 


He 

xxc 






140 










XUX DCJU 


xxc 


srlXXS 




XJCU 


wyo 






155 










160 


Val Cys 


His 


Pro 


Val 


Lys 


Al a 

AXCl 


XJCU 


170 










175 




Lys He 


Val 


Asn 


Val 




Asn 


lip 


185 








190 






Pro Val 


Met 


Phe 


Met 


Ala 


Tiir 


Thr 








205 








Cys Thr 


Leu 


Thr 


Phe 


Ser 


His 


Pro 






220 










Lys lie 


Cys 


Val 


Glv 


He 


Phe 


Ala 




235 










240 








ni V 


XJC ix 




Tl^ 
xxc 


250 














OCX 


Glv 

V3 JL _y 


■9 CSX 


XI jr 9 


Glu 

wX IX 


Lys 


A cm 


265 








270 






Met Val 


XJv? u 


Val 


Val 

V CLX 


Val 


Ala 

fax A 


Val 

VAX 








285 








His He 


i.yx 


Val 


He 


He 


XI jr 0 


Ala 
nx A 






300 












Thr 


Val 


OCX 




His 


JrXlC 




315 










320 


OCX \^yo 


XiCU 


aSU 


Pro 


Val 
VAX 


Xjeu 


lyr 


7 n 
J J u 














Arg Cys 


Phe 


Arg 


Glu 


Phe 


Cys 


He 


345 








350 






Gin Asn 


Ser 


Thr 


Arg 


Val 


Arg 


Gin 








365 








Ala Asn 


Thr 


Val 


Asp 


Arg 


Thr 


Asn 






380 










Glu Thr 


Ala 


Pro 


Leu 


Pro 







395 



<210> 20 
<211> 1401 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic constinact 

<400> 20 

cacgagcttc tgcctgccgc tcttctctgg ttccactagg gctggtccat gtaagaatct 60 
gacggagcct agggcagctg tgagaggaag aggctggggc gcgtggaacc cgaaaagtct 120 
gagtgctctc agttacagcc tacctagtcc gcagcaggcc ttcagcacca tggacagcag 180 
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caccggccca gggaacacca gcgactgctc agacccctta gctcaggcaa gttgctcccc 240 
agcacctggc tcctggctca acttgtccca cgttgatggc aaccagtccg atccatgcgg 300 
tctgaaccgc accgggcttg gcgggaacga cagcctgtgc cctcagaccg gcagcccttc 360 
catggtcaca gccattacca tcatggccct ctactctatc gtgtgtgtag tgggcctctt 420 
cggaaacttc ctggtcatgt atgtgattgt aagatacacc aaaatgaaga ctgccaccaa 480 
catctacatt ttcaaccttg ctctggcaga cgccttagcg accagtacac tgccctttca 540 
gagtgtcaac tacctgatgg gaacatggcc cttcggaacc atcctctgca agatcgtgat 600 
ctcaatagat tactacaaca tgttcaccag catattcacc ctctgcacca tgagcgtgga 660 
ccgctacatt gctgtctgcc acccagtcaa agccctggat ttccgtaccc cccgaaatgc 720 
caaaatcgtc aacgtctgca actggatcct ctcttctgcc atcggtctgc ctgtaatgtt 780 
catggcaacc acaaaataca ggcaggggtc catagattgc accctcacgt tctcccaccc 840 
aacctggtac tgggagaacc tgctcaaaat ctgtgtcttt atcttcgctt tcatcatgcc 900 
ggtcctcatc atcactgtgt gttacggcct gatgatctta cgactcaaga gcgttcgcat 960 
gctatcgggc tccaaagaaa aggacaggaa tctgcgcagg atcacccgga tggtgctggt 1020 
ggtcgtggct gtatttatcg tctgctggac ccccatccac atctacgtca tcatcaaagc 1080 
gctgatcacg attccagaaa ccacatttca gacggtttcc tggcacttct gcattgcttt 1140 
gggttacacg aacagctgcc tgaatccagf tctttacgcc ttcctggatg aaaacttcaa 1200 
gcgatgcttc agagagttct gcatcccaac ctcgtccacg atcgaacagc aaaactccac 1260 
tcgagtccgt cagaacacta gggaacatcc ctccacggct aatacagtgg atcgaactaa 1320 
ccaccagcta gaaaatctgg aggcagaaac tgctccattg ccctaactgg gtctcacacc 1380 
atccagaccc tcgctaagct t ^ 1401 

<210> 21 

<211> 401 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 21 



Met Asp 


Ser 


Ser 


Ala 


Asp 


Pro 


Arg 


Asn 


Ala 


Ser Asn Cys Thr Asp Pro 


1 






5 










10 


IS 


Phe Ser 


Pro 


Ser 


Ser 


Met 


Cys 


Ser 


Pro 


Val 


Pro Ser Pro Ser Ser Trp 






20 










25 




• 30 


Val Asn 


Phe 


Ser 


His 


Leu 


Glu 


Gly 


T^n 


Leu 


Ser Asp Pro Cys He Arg 




35 










40 






45 


Aen Arg 


Thr 


Glu 


Leu 


Gly 


Gly 


Ser 


Asp 


Ser 


Leu Cya Pro Pro Thr Gly 


50 










55 . 








60 


Ser Pro 


Ser 


Met 


Val 


Thr 


Ala 


He 


Thr 


He 


Met Ala Leu Tyr Ser He 


65 








70 










75 80 


Val Cys 


Val 


Val 


Gly 


Leu 


Phe 


Gly 


Asn 


Phe 


Leu Val Met Tyr Val He 








85 










90 


95 


Val Arg 


Tyr 


Thr 


Lys 


Met 


Lys 


Thr 


Ala 


Thr 


Asn He Tyr He Phe Asn 






100 










105 




110 


Leu Ala 


Leu 


Ala 


Asp 


Ala 


Leu 


Ala 


Thr 


Ser 


Thr Leu Pro Phe Gin Ser 




115 










120 






125 


Val Asn 


Tyr 


Leu 


Met 


Gly 


Thr 


Trp 


Pro 


Phe 


Gly Thr He Leu Cys Lys 


130 










135 








140 


lie Val 


He 


Ser 


He 


Asp 


Tyr 


Tyr 


Asn 


Met 


Phe Thr Ser He Phe Thr 


145 








150 










155 160 


Leu Cys Thr 


Met 


Ser 


Val 


Asp 


Arg 


Tyr 


He 


Ala Val Cys His Pro Val 








165 










170 


175 


Lys Ala Leu 


Asp 


Phe 


Arg 


Thr 


Pro 


Arg 


Asn 


Ala Lys He He Asn Val 






180 










185 




190 


Cys Asn 


Trp 


He 


Leu 


Ser 


Ser 


Ala 


He 


Gly 


Leu Pro Val Met Phe Met 




195 










200 






205 


Ala Thr 


Thr 


Lys 


Tyr 


Arg 


Asn 


Gly 


Ser 


He 


Asp Cys Ala Leu Thr Phe 



210 215 220 
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His 


Pro 


Thr Trp Tyr 


xxjj VXLi rUaXl 


Leu 


Leu 


Lys 


He 


Cys Val Phe 


225 










230 






235 






240 


lie 


Phe 


Ala 


Phe 


He 

245 


Met 


*ri>v^ Vd^ JUCU 


He 

250 


He 




Val 


Cys Tyr Gly 
255 


Leu 


Met 


He 


Leu 


Arg 


Leu 


TiVcs Q*»T* Val 


Arg Met 




Ser Gly Ser Lys 








260 






265 










270 


61u 




ASD 


Arg Asn Leu 




Thr Arg 


Met 


Val 


Leu Val Val 






275 








280 








285 




Val 


Ala 


Val 


Phe 


He 


Val 


fvO TtT^ T*!!*!^ 
\0jff9 XUJU 


Pro 


He 




He Tyr Val He 




290 










295 












lie 




Ala 


Leu 


He 


Thr 


T 1 <a Tim a'] n 


Thr 


Thr 




Gin 


Thr Val Ser 


305 










310 






315 






320 


irp 


Hi. 8 




Cys 


He 
325 


Ala 


UCU \3Xy Ajfi^ 


Thr 
330 


Asn 


Ser 


Cys 


Leu Asn Pro 
335 


VCIX 


Lieu 


Tyx 


Ala 
340 


Phe 


Leu 


iiSp IvJLU ABU 

345 


Phe 


Lys 


Arg 


Cys 


Phe Arg Glu 
350 


Phe 


Cys 


lie 


Pro 


Thr 


Ser 


Ser Thr He 


Glu 


Gin 


Gin 


Asn Ser Ala Arg 






355 








360 








365 




He 


Arg 


Gin 


Asn Thr Arg 


Asp His Pro 


Ser 


Thr 


Ala 


Asn 


Thr Val Asp 




370 










375 






380 






Arg 


Thr 


Asn 


His 


Gin 


Leu 


Glu Asn Leu 


Glu 


Ala 


Glu 


Thr 


Ala Pro Leu 


385 










390 






395 






400 


Pro 

























•<210> 22 
<211> 1881 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: /note = 
synthetic construct 

<400> 22 

atgagctgtg- gtgtacttct aagatgggag ggggcaacaa 
cttagctctc cttctgcctg acgctcctct ctggctccgc 
agtagcagga gccgtggcgg gggctggagg aagcggctga 
cccgggtgat cgcggttacc tcactgcgtg gtcccagccg 
acagcagcgc tgacccccga aacgccagca attgcactga 
tgtgctcccc agtacctagc cccagctcct gggtcaactt 
tgtccgaccc atgcattcgg aaccgcaccg agctgggcgg 
cgaccggcag tccttccatg gtcacggcca tcaccatcat 
gcgtggtggg tctcttcgga aacttcctgg tcatgtatgt 
tgaagactgc caccaacatc tatattttca accttgctct 
gtaccctacc cttccagagt gtcaattacc taatgggaac 
tctgcaagat cgtgatctcc atagattact acaatatgtt 
gtaccatgag cgtggatcgc tacatcgccg tctgccatcc 
gcactccccg caacgccaaa atcatcaacg tctgcaactg 
gtctgcctgt gatgttcatg gcaacaacaa agtaccggaa 
taacattctc tcacccaacc tggtactggg aaaacctgct 
ttgccttcat catgcctgtc ctcatcatta cggtgtgtta 
tcaagagtgt tcgcatgctc tctggctcca aagaaaagga 
ccaggatggt gctggtggtt gtggctgtgt tcattgtctg 
acgtcatcat taaagccttg attacaattc cagaaactac 
acttctgcat tgctctaggt tatacaaaca gctgcctgaa 
tggatgaaaa cttcaaacga tgcttcagag agttctgtat 
agcagcaaaa ctccgctcga atccgtcaaa acaccagaga 
cggtggacag gaccaaccat cagctagaaa atctggaagc 
aaccaggtgt catgccattc agatcctcaa tgagctaaga 



gcagaagata atgtcagaag 60 
ctgggttggc ttctgtaaga 120 
ggcgcgtgga acccgaaaag IBO 
cccagccgtc agcaccatgg 240 
tcccttctcg ccctcttcaa 300 
ctcccactta gaaggcaacc 360 
gagcgacagc ctgtgccctc 420 
ggccctctac tccatcgtgt 480 
gattgtcaga tacaccaaaa 540 
ggcggatgcc ttagccacca 600 
gtggccgttt ggaaccatcc 660 
caccagcata ttcaccctct 720 
cgtcaaggcc ctggacttcc 780 
gatcctctct tcagccattg 840 
tggttccata gattgtgcac 900 
gaaaatctgt gttttcatct 960 
tgggctgatg atcttacgcc 1020 
taggaacctg cgaagaatca 1080 
ctggactccc attcacattt 1140 
tttccagact gtgtcctggc 1200 
cccagtcctt tatgcatttc 1260 
cccaacctcc tccaccattg 1320 
ccacccctcc acggccaaca 1380 
agaaactgct ccattgccct 1440 
cagccaccat ctacgtggaa 1500 
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gcaggttgcc atgagaatgt gtgggaggca ctattttcct aggaaagtgc ctgctctgag 1560 
tcatcaaatc tgtttcctct ctggccgctc tgctctgcac atgagaggga catccaaact 1620 
aaatcaagca ctaggaagga aagaactaat ccacatggag tttgcctgtg cacataatct 1680 
caaggaagat gacccatggg accgaaacat gctgtggtat gtgcgttgag gtcatcctca 1740 
aagatggccc ttctgtatgt aatgtgctgt tttcaagcaa atgtttacgt cctcatcaaa 1800 
gaaaaaatgt cagttgttaa attcaccata gtaacttgta aaggctacct ctgatcgaag 1860 
catcttatgt ggaaatccaa g 1881 

<210> 23 
<211> 372 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Pescription of Artificial Sequence : /note 
synthetic construct 



<400> 23 


















Met Glu 


Pro 




OCX. 


r\J.CL 


vjxy 


AT a 


Rill 


XIBU 


wXii irxu jrx%^ ucu Jet lie 


1 




c 










T A 

Xu 






Ala- Asn Ala 






Tyr 


Pro 


Ser 




iriie 


irxO OCX nXo. oXy aXa 














AO 






1 A 


Asn Ala 


Ser 


v^y irf^O 


Pro 






Arg 


Ser 


al a 
nxa 


OCX OCX XJCU aXa iJCU 




35 


















Ala lie 


Ala 






xjcsu 


ryx 


sex 


nXca 


veix 


^jro AX el VCLX \9Xjr XJCU 


50 


















C A 


Leu Qly Asn 


VCLX JJCU 


yen. 


Mot* 


Irxic 


urJiy 


Tl *a 


VaX 


Tli'V* TH^>* T*!^!^ T.Tm I^Ai* 

Axy • Ajjrx xiix. Jjy& jyicl. 


65 






•7 A 










/ 3 


Q A 


Lys Thr 


Ala 




Tl 


lyr 


xxc 


JrUe 


A en 


XJCU 


ala TiAii al A Aon Ala 
nxa XJCU nXcl nsf^ aXCI 
















Q A 






Leu Ala 


Hir 


DCS J. liXlX 




Pro 


iriic 




OCX 


AXCt 


TAra TH/^ T>Aii Mot* ^1 11 

xjyo '"•jr'^ xicu tuci^ V7xu 














1 AC 






T T A 

XxU 


Thr Trp 


Pro 


IT lie ysXy 


m n 

vjX U. 


Leu 


Leu 


Cys 


Lys 


iij.a 


VdJ. IjcU OCX XXc 




115 








Ton 








1 0 K 


Tyr Tyr 


Asn 


PICL. flic 




OCX 




flic 


XllX 


XJCU 


Tli-r MA-h M^h Q^T" Val 

XllX I'lCU I'lCi* OCX VCLX 


130 


















1 A A 

141/ 


Asp Arg 


Tyr 


He Ala 


Val 


Cys 


His 


Pro 


Val 


Lys 


Ala Leu Asp Phe Arg 


145 






150 










155 


160 


Thr Pro 


Ala 


Lys Ala 


Lys 


Leu 


He 


Asn 


He 


Cys 


He Trp Val Leu Ala 






165 










170 




175 


Ser Gly 


Val 


Gly Val 


Pro 


He 


Met 


Val 


Met 


Ala 


Val Thr Arg Pro Arg 






180 








185 






190 


Asp Gly 


Ala 


Val Val 


Cys 


Met 


Leu 


Gin 


Phe 


Pro 


Ser Pro Ser Ti^p Tyr 




195 








200 








205 


Trp Asp 


Thr 


Val Thr 


Lys 


He 


Cys 


Val 


Phe 


Leu 


Phe Ala Phe Val Val 


210 








215 










220 


Pro lie 


Leu 


He He 


Thr 


Val 


Cys 


Tyr 


Gly 


Leu 


Met Leu Leu Arg Leu 


225 






230 










235 


240 


Arg Ser 


Val 


Arg Leu 


Leu 


Ser 


Gly 


Ser 


Lys 


Glu 


Lys Asp Arg Ser Leu 






245 










250 




255 


Arg Arg 


He 


Thr Arg 


Met 


Val 


Leu 


val 


Val 


Val 


Gly Ala Phe Val Val 






260 








265 






270 


Cys Trp 


Ala 


Pro He 


His 


He 


Phe 


Val 


He 


val 


Trp Thr Leu Val Asp 




275 








280 








285 


lie Asp 


Arg 


Arg Asp 


Pro 


Leu 


Val 


Val 


Ala 


Ala 


Leu His Leu Cys He 


290 








295 










300 


Ala Leu 


Gly 


Tyr Ala 


Asn 


Ser 


Ser 


Leu 


Asn 


Pro 


Val Leu Tyr Ala Phe 


305 






310 










315 


320 


Leu Asp 


Glu 


Asn Phe 


Lys 


Arg 


Cys 


Phe 


Arg 


Gin 


Leu Cys Arg Lys Pro 






325 










330 




335 
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Cys Gly Arg Pro Asp Pro Ser Ser Phe Ser Arg Ala Arg Glu Ala Thr 

340 345 350 

Ala Arg Glu Arg Val Thr Ala Cys llir Pro Ser Asp Gly Pro Gly Gly 

355 360 365 

Qly Ala Ala Ala 
370 



<210> 24 
<211> 1773 
<212> DNA 

<213> Artificial Sequence 
<220> 

c223> Description of Artificial Sequence : /note « 
synthetic construct 

<400> 24 

ccgaggagcc tgcgctgctc ctggctcaca gcgctccggg cgaggagagc gggcggaccg 60 
gggggctggg ccggtgcggg cggcgaggca ggcggacgag gcgcagagac agcggggcgg 120 
ccggggcgcg gcacgcggcg ggtcggggcc ggcctctgcc ttgccgctcc cctcgcgtcg 180 
gatccccgcg cccaggcagc cggtggagag ggacgcggcg gacgccggca gccatggaac 240 
cggccccctc cgccggcgcc gagctgcagc ccccgctctt cgccaacgcc tcggacgcct 300 
accctagcgc cttccccagc gctggcgcca atgcgtcggg gccgccaggc gcgcggagcg 360 
cctcgtccct cgccctggca atcgccatca ccgcgctcta ctcggccgtg tgcgccgtgg 420 
ggctgctggg caacgtgctt gtcatgttcg gcatcgtccg gtacactaag atgaagacgg 480 
ccaccaacat ctacatcttc aacctggcct tagccgatgc gctggccacc agcacgctgc 540 
ctttccagag tgccaagtac ctgatggaga cgtggccctt cggcgagctg ctctgcaagg 600 
ctgtgctctc catcgactac tacaatatgt tcaccagcat cttcacgctc accatgatga 660 
gtgttgaccg ctacatcgct gtctgccacc ctgtcaaggc cctggacttc cgcacgcctg 720 
ccaaggccaa gctgatcaac atctgtatct gggtcctggc ctcaggcgtt ggcgtgccca 780 
tcatggtcat ggctgtgacc cgtccccggg acggggcagt ggtgtgcatg ctccagttcc 840 
ccagccccag ctggtactgg gacacggtga ccaagatctg cgtgttcctc ttcgccttcg 900 
tggtgcccat cctcatcatc accgtgtgct atggcctcat gctgctgcgc ctgcgcagtg 960 
tgcgcctgct gtcgggctcc aaggagaagg accgcagcct gcggcgcatc acgcgcatgg 1020 
tgctggtggt tgtgggcgcc ttcgtggtgt gttgggcgcc catccacatc ttcgtcatcg 1080 
tctggacgct ggtggacatc gaccggcgcg acccgctggt ggtggctgcg ctgcacctgt 1140 
gcatcgcgct gggctacgcc aatagcagcc tcaaccccgt gctctacgct ttcctcgacg 1200 
agaacttcaa gcgctgcttc cgccagctct gccgcaagcc ctgcggccgc ccagacccca 1260 
gcagcttcag ccgcgcccgc gaagccacgg cccgcgagcg tgtcaccgcc tgcaccccgt 1320 
ccgatggtcc cggcggtggc gctgccgcct gaccaggcca tccggccccc agacgcccct 1380 
ccctagttgt acccggaggc cacatgagtc ccagtgggag gcgcgagcca tgatgtggag 1440 
tggggccagt agataggtcg gagggctttg ggaccgccag atggggcctc tgtttcggag 1500 
acgggaccgg gccgctagat gggcatgggg tgggcctctg gtttggggcg aggcagagga 1560 
cagatcaatg gcgcagtgcc tctggtctgg gtgcccccgt ccacggctct aggtggggcg 1620 
ggaaagccag tgactccagg agaggagcgg gacctgtggc tctacaactg agtccttaaa 1680 
cagggcatct ccaggaaggc ggggcttcaa ccttgagaca gcttcggttt ctaacttgga 1740 
gccggacttt cggagttggg gggtccgggg ccc 1773 

<210> 25 
<211> 228 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<:400> 25 

Gly He- Val Arg Tyr Thr Lys Met Lys Thr Ala Thr Asn lie Tyr lie 
1 5 ♦ 10 ' 15 - 
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Phe 


Asn 


Leu 


Ala 
20 


Leu 


Ala 


Asp Ala 


Gin 


Ser 


Ala 

35 


Lvs 


Tvr 


Leu 


Met Glu 
40 


Cys 


Lys 
■50 


Ala 


Val 


Leu 


Ser 


lie Asp 
55 


Phe 


Thr 


Leu 


Thr 


Met 


Met 


Ser Val 


65 










70 






Val 

VAX 




Ala 


Leu 
85 


Asp 


Phe Arg 


Ash 


He 


Cys 


He 
100 


Tro 


Val 


Leu Ala 


Val 


Met 


Ala 

115 


Val 


"l^r 


Ara 


Pro Arg 
120 


Gin 


Phe 
130 


Pro 


Ser 


Pro 


Ser 


135 


Val 


•Phe 


Leu 


Phe 


Ala 


Phe 


Val Val 


145 










150 






Glv 


Leu 


Met 


Leu 

165 


Leu 


Ara Leu 


Ser 


Lys 


Glu 


Lys 
IBO 


Asp 


Arg 


Ser Leu 


Val 


Val 


Val 
195 


Qly 


Ala 


Phe 


Val Val 
200 


Val 


He 
210 


Val 


Trp 


Thr 


Leu 


Val Asp 
215 


Val 


Ala 


Ala 


Leu 









225 



<210> 26 

<211> 372 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
synthetic construct 



<400> 26 



Met 


Glu 


Pro 


Val 


Pro 


Ser 


Ala 


Arg 


1 








5 








Ala 


Asn 


Val 


Ser 


Asp 


Thr 


Phe 


Pro 








20 










Asn 


Ala 


Ser 


Gly 


^er 


Pro 


Gly 


Ala 






35 










40 


Ala 


He 


Ala 


He 


Thr 


Ala 


Leu 


Tyr 




50 










55 




Leu 


Gly 


Asn 


Val 


Leu 


Val 


Met 


Phe 


65 










70 






liys 


Thr 


Ala 


Thr 


Asn 


He 


Tyr 


He 










85 








Leu 


Ala 


Thr 


Ser 


Thr 


Leu 


Pro 


Phe 








100 










Thr 


Trp 


Pro 


Phe 


Gly 


Glu 


Leu 


Leu 






115 










120 


Tyr 


Tyr 


Asn 


Met 


Phe 


Thr 


Ser 


He 




130 










135 




Asp 


Arg 


Tyr 


He 


Ala 


Val 


Cys 


His 


145 










150 







Leu Ala 


Thr 


Ser* 


Thr 


Leu 


Pro 


Phe 


25 








30 






Thr Trp 


Pro 


Phe 


Glv 


Glu 


Leu 


Leu 








45 










Asn 


Met 


Phe 


Thr 


Ser 


He 






60 










Asp Arg 


j-yx 


He 


Ala 


Val 


Cys 


His 




75 










80 


XXIX JET X W 


Ala 


xiy B 


Ala 


Lys 


Leu 


He 


90 










95 




Ser Gly 


Val 


Qlv 
vaxy 


Val 


Pro 


He 


Met 


105 








110 








Ala 


Val 

VAX 


Val 


Cys 


Met 


Leu 








125 








Tttb Ann 


XXIX 


Val 

VAX 


A XXX 


xiy B 


He 

xxc 


v*y 9 






140 










Pro He 


Leu 


Val 


He 


Thr 


Val 


Cys 




155 










160 


Arg Ser 


Val 


Arg 


Leu 


Leu 


Ser 


Gly 


170 










175 




Arg Arg 


He 


Thr 


Arg 


Met 


Val 


Leu 


185 








190 






Cys Trp 


Ala 


Pro 


He 


His 


He 


Phe 








205 








He Asp 


Arg 


Arg 


Asp 


Pro 


Leu 


Val 



220 



Sequence : /note = . 



Ala 


Glu 
10 


Leu 


Gin 


Phe 


Ser 


Leu 
15 


Leu 


Ser 


Ala 


Phe 


Pro 


Ser 


Ala 


Ser 


Ala 


25 










30 






Arg 


Ser 


Ala 


Ser 


Ser 
45 


Leu 


Ala 


Leu 


Ser 


Ala 


Val 


Cys 
60 


Ala 


Val 


Gly 


Leu 


Gly 


He 


Val 
75 


Arg 


Tyr 


Thr 


Lys 


Leu 
80 


Phe 


Asn 
90 


Leu 


Ala 


Leu 


Ala 


Asp 
95 


Ala 


Gin 


Ser 


Ala 


Lys 


Tyr 


Leu 


Met 


Glu 


105 










110 






Cys 


Lys 


Ala 


Val 


Leu 
125 


Ser 


He 


Asp 


Phe 


Thr 


Leu 


Thr 

14 0 


Met 


Met 


Ser 


Val 


Pro 


Val 


Lys 
155 


Ala 


Leu 


Asp 


Phe 


Arg 
160 
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Thr 


Pro 


Ala 


Lys Ala Lys Leu 


He 








165 




Ser 


Glv 


Val 


Gly Val Pro lie 


Met 








180 




Asp 


Glv 


Ala 


Val Val Cys Thr 


Leu 






195 




200 


Tro 


Asp 


Thr 


Val Thr Lys lie 


Cvs 




210 




215 




Pro 


lie 


Leu 


lie He Thr Val 


Cys 


225 






230 






Ser 


Val 


Ar9 Leu Leu Ser 


Gly 








245 








lie 


Thr Atq Met Val 


Leu 








260 






Tina 


Ala 


Pro He His He 


Phe 






275 




280 


lie 


Asn 




Arg Asp Pro lieu 


Val 




290 




295 




Ala 


Leu 


Glv 


Tyr Ala Asn Ser 


Ser 


305 






310 




Ijeu 


Asp 


Glu 


Asn Fhe Lys Arg 


Cvs 








325 




eye 


Gly 


Gly 


Gin Glu Pro Gly 


Ser 








340 




Ala 


Arg 


Glu 


Arg Val Thr Ala 


Cys 






355 




360 


Gly 


Ala 


Ala 


Ala 






370 









Asn 


He 


Cys 


He 


Trp 


Val 


Leu 


Ala 




170 










175 




Val 


Met 


Ala 


Val 


Thr 


Gin 


Pro 


Arg 


185 










190 






Gin 


Phe 


Pro 


Ser 


Pro 


Ser 


Trp 


Tyr 










205 








Val 


Phe 


Leu 


Phe 


Ala 


Phe 


Val 


Val 








220 










Tvx 


Gly 


Leu 


Met 


Leu 


Leu 


Arg 


Leu 






235 










240 


Ser 


Lys 


Glu 


Lys 


Asp 


Arg 


Ser 


Leu 




250 










255 




Val 


Val 


Val 


Glv 


Ala 


Phe 


Val 


Val 


265 










270 






Val 


He 


Val 


Tro 


Thr 


Leu 


Val 


Asn 










285 








Val 


Ala 


Ala 


Leu 


His 


Leu 


Cys 


He 








300 










Leu 


Asn 


Pro 


Val 


Leu 


Tyr 


Ala 


Phe 






315 










320 


Phe 


Arg 


Gin 


Leu 


Cys 


Arg 


Ala 


Pro 




330 










335 




Leu 


Arg 


Arg 


Pro 


Arg 


Gin 


Ala 


Thr 


345 










350 






Thr 


Pro 


Ser 


Asp 


Gly 


Pro 


Gly 


Gly 



365 



<210> 27 

<211> 372 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note a 
synthetic construct 



<400> 27 



Met 


Glu 


Leu 


Val 


Pro 


Ser 


Ala 


Arg 


Ala 


Glu 


Leu 


Gin 


Ser 


Ser 


Pro 


Leu 


1 








5 










10 










15 




Val 


Asn 


Leu 


Ser 

20 


Asp 


Ala 


Phe 


Pro 


Ser 
25 


Ala 


Phe 


Pro 


Ser 


Ala 
30 


Gly 


Ala 


Asn 


Ala 


Ser 
35 


Gly 


Ser 


Pro 


Gly 


Ala 

40 


Arg 


Ser 


Ala 


Ser 


Ser 
45 


Leu 


Ala 


Leu 


Ala 


He 
50 


Ala 


He 


Thr 


Ala 


Leu 
55 


Tyr 


Ser 


Ala 


Val 


Cys 
60 


Ala 


Val 


Gly 


Leu 


Leu 


Gly 


Asn 


Val 


Leu 


Val 


Met 


Phe 


Gly 


He 


Val 


Arg 


Tyr 


Thr 


Lys 


Leu 


65 










70 










75 










80 


Lys 


Thr 


Ala 


Thr 


Asn 


He 


Tyr 


He 


Phe 


Asn 


Leu 


Ala 


Leu 


Ala 


Asp 


Ala 








85 










90 










95 




Leu 


Ala 


Thr 


Ser 
100 


Thr 


Leu 


Pro 


Phe 


Gin 
105 


Ser 


Ala 


Lys 


Tyr 


Leu 
110 


Met 


Glu 


Thr 


Trp 


Pro 
115 


Phe 


Gly 


Glu 


Leu 


Leu 
120 


Cys 


Lys 


Ala 


Val 


Leu 
125 


Ser 


He 


Asp 


Tyr 


Tyr 


Asn 


Met 


Phe 


Thr 


Ser 


He 


Phe 


Thr 


Leu 


Thr 


Met 


Met 


Ser 


Val 


130 










135 










140 










Asp 


Arg 


Tyr 


He 


Ala 


Val 


Cys 


His 


Pro 


Val 


Lys 


Ala 


Leu 


Asp 


Phe 


Arg 


145 










150 










155 










160 
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Th.]r Pro 


Ala 


Lvs 


Ala 
165 


Lvs 


Leu 


lie 


Asn 


lie 
170 


Cvs 


lie 


Trp 


Val 


Leu 
175 


Ala 


Sbt Gly 


Val 


GlV 

180 


Val 


Pro 


lie 


Met 


Val 
185 


Met 


Ala 


Val 


Thr 


Gin 

190 


Pro 


Arg 




Ala 
195 


Val 


Val 


Cys 


Met 


Leu 
200 


Gin 


Phe 


Pro 


Ser 


Pro 
205 


Ser 


TrD 


Tvr 




Thr 


Val 


Thr 


Lys 


lie 


Cys 


Val 


Phe 


Leu 


Phe 


Ala 


Phe 


Val 


Val 


210 










215 










220 












Leu 


Tie 


lie 


Thr 


Val 






Glv 


Leu 


Met 


Leu 


Leu 


Arcr 


Leu 


225 








230 










235 










240 




Val 




Leu 
245 


Leu 


Ser 


Glv 


Ser 


Lvs 
250 


Glu 


Lys 


Asp 


ArQ 


Ser 
255 


Leu 




lie 


"flir 
260 




Met 


Vai 


Leu 


Val 
265 


Val 


Val 


Glv 


Ala 


Phe 
270 


Val 


Val 


O XJLj^ 


Ala 
275 


Pro 


lie 


His 


lie 


Phe 
280 


Val 


lie 


Val 




Thr 
285 


Leu 


Val 


Asp 


Tl liQTl 




TV t-f-T 


Asp 


Pro 


Leu 


Val 


Val 


Ala 


Ala 


Leu 


His 


Leu 


Cvs 


lie 


290 










295 










300 














xyx 








Ser 


Leu 


Asn 


Pro 


Val 


Leu 


Tvr 


Ala 


Phe 


305 








310 










315 










320 


Leu Asp 


GlU 


Asix 


Phe 
325 


Lys 


Arg 


Cys 


Phe 


Arg 
330 


Gin 


Leu 


Cys 


Arg 


Thr 
335 


Pro 


Cys Gly 


Arg 


Gin 
340 


Glu 


Pro 


Gly 


Ser 


Leu 
345 


Arg 


Arg 


Pro 


Arg 


Gin 
350 


Ala 


Thr 


Thr Arg 


Glu 
355 


Arg 


Val 


Thr 


Ala 


Cys 
360 


Thr 


Pro 


Ser 


Asp 


Gly 
365 


Pro 


Gly 


Gly 



Gly Ala Ala Ala 
370 



<210> 28 
<211> 2219 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 28 

ctctaaaggc tgggtccctg cgcccagggc gcacggtgga gacggacacg gcggcgccat 60 
ggagctggtg ccctctgccc gtgcggagct gcagtcctcg cccctcgtca acctctcgga 120 
cgcctttccc agcgccttcc ccagcgcggg cgccaatgcg tcggggtcgc cgggagcccg 180 
tagtgcctcg tccctcgccc tagccatcgc catcaccgcg ctctactcgg ctgtgtgcgc 240 
agtggggctt ctgggcaacg tgctcgtcat gtttggcatc gtccggtaca ccaaattgaa 300 
gaccgccacc aacatctaca tcttcaatct ggctttggct gatgcgctgg ccaccagcac 360 
gctgcccttc cagagcgcca agtacttgat ggaaacgtgg ccgtttggcg agctgctgtg 420 
caaggctgtg ctctccattg actactacaa catgttcact agcatcttca ccctcaccat 480 
gatgagcgtg gaccgctaca ttgctgtctg ccatcctgtc aaagccctgg acttccggac 540 
accagccaag gccaagctga tcaatatatg catctgggtc ttggcttcag gtgtcggggt 600 
ccccatcatg gtcatggcag tgacccaacc ccgggatggt gcagtggtat gcatgctcca 660 
gttccccagt cccagctggt actgggacac tgtgaccaag atctgcgtgt tcctctttgc 720 
cttcgtggtg ccgatcctca tcatcacggt gtgctatggc ctcatgctac tgcgcctgcg 780 
cagcgtgcgt ctgctgtccg gttccaagga gaaggaccgc agcctgcggc gcatcacgcg 840 
catggtgctg gtggtggtgg gcgccttcgt ggtgtgctgg gcgcccatcc acatcttcgt 900 
catcgtctgg acgctggtgg acatcaatcg gcgcgaccca cttgtggtgg ccgcactgca 960 
cctgtgcatt gcgctgggct acgccaacag cagcctcaac ccggttctct acgccttcct 1020 
ggacgagaac ttcaagcgct gcttccgcca gctctgtcgc acgccctgcg gccgccaaga 1080 
acccggcagt ctccgtcgtc cccgccaggc caccacgcgt gagcgtgtca ctgcctgcac 114 0 
cccctccgac ggcccgggcg gtggcgctgc cgcctgacct acccgacctt ccccttaaac 1200 
gcccctccca agtgaagtga tccagaggcc acaccgagct ccctgggagg ctgtggccac 1260 
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caccaggaca gctagaattg ggcctgcaca gaggggaggc ctcctgtggg gacggggcct 1320 
gagggatcaa aggctccagg ttggaacggt gggggtgagg aagcagagct ggtgattcct 1380 
aaactgtatc cattagtaag gcctctccaa tgggacagag cctccgcctt gagataacat 1440 
cgggttctgg cctttttgaa cacccagctc cagtccaaga cccaaggatt ccagctccag 1500 
gaaccaggag gggcagtgat ggggtcgatg atttggtttg gctgagagtc ccagcatttg 1560 
tgttatgggg aggatctctc atcttagaga agataagggg acagggcatt caggcaaggc 1620 
agcttggggt ttggtcagga gataagcgcc cccttccctt ggggggagga taagtggggg 1680 
atggtcaacg ttggagaaga gtcaaagttc tcaccacctt tctaactact cagctaaact 1740 
cgttgaggct agggccaacg tgacttctct gtagagagga tacaagccgg gcctgatggg 1800 
gcaggcctgt gtaatcccag tcatagtgga ggctgaggct ggaaaattaa ggaccaacag 1860 
cctgggcaat ttagtgtctc aaaataaaat gtaaagaggg ctgggaatgt agctcagtgg 1920 
tagggtgttt gtgtgaggct ctgggatcaa taagacaaaa caaccaacca accaaaaacc 1980 
ttccaaacaa caaaaccaac cctcaaacca aaaaactatg tgggtgtctc tgagtctggt 2040 
ttgaagagaa cccgcagccc tgtatccctg tggggctgtg gacagtgggc agaagcagag 2100 
gctccctgga tcctgaacaa gggccccaaa agcaagttct aaagggaccc ctgaaaccga 2160 
gtaagccttt gtgtcaagaa gtgggagtac aaccagaaag gtggctgagt gctttagag 2219 

<210> 29 
<211> 380 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 29 - . 

Met Asp Ser Pro lie Gin lie Phe Arg Gly Glu Pro Gly Pro Thr Cys 

X • 5 ■ 10 • 15 

Ala Pro Ser Ala Cys Leu Pro Pro Asn Ser Ser Ala Trp Phe Pro Gly 

20. .25 30 

Trp Ala Glu Pro Asp Ser Asn Gly Ser Ala Gly Ser Glu Asp Ala Gin 

35 40 45 

Leu Glu Pro Ala His lie Ser Pro Ala lie Pro Val lie He Thr Ala 

50 55 60 

Val Tyr Ser Val Val ' Phe Val Val Gly Leu Val Gly Asn Ser Leu Val 
65 70 75 80 

Met Phe Val lie lie Arg Tyr Thr Lys Met Lys Thr Ala Thr Asn He 

85 90 95 

Tyr He Phe Asn Leu Ala Leu Ala Asp Ala Leu Val Thr Thr Thr Met 

100 105 110 

Pro Phe Gin Ser Thr Val Tyr Leu Met Asn Ser Trp Pro Phe Gly Asp 

115 120 125 

Val Leu Cys Lys He Val He Ser He Asp Tyr Tyr Asn Met Phe Thr 

130 135 140 

Ser He Phe Thr Leu Thr Met Met Ser Val Asp Arg Tyr • He Ala Val 
145 150 155 160 

Cys His Pro Val Lys Ala Leu Asp Phe Arg Thr Pro Leu Lys Ala Lys 

165. 170 175 

He He Asn He Cys He Trp Leu Leu Ser Ser Ser Val Gly He Ser 

180 185 190 

Ala He Val Leu Gly Gly Thr Lys Val Arg Glu Asp Val Asp Val He 

195 200 205 

Glu Cys Ser Leu Gin Phe Pro Asp Asp Asp Tyr Ser Trp Trp Asp Leu 

210 215 220 

Phe Met Lys He Cys Val Phe He Phe Ala Phe Val He Pro Val Leu 
225 230 235 240 

lie He He Val Cys Tyr Thr Leu Met He Leu Arg Leu Lys Ser Val 

245 250 255 

Arg Leu Leu Ser Gly Ser Arg Glu Lys Asp Arg Asn Leu Arg Arg He 
260 265 . . 270 
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Thr Arg 


Leu 


Val 


Leu Val Val 


Val 




275 






280 


Pro He 


His 


He 


Phe He Leu 


Val 


290 






295 




Ser Thr 


Ala 


Ala 


Leu Ser Ser 


Tvr 


305 






310 


Thr Asn 


Ser 


Ser 


Leu Asn Pro 


He 








325 




Phe Lys 


Arg Cys 


Phe Arg Asp 


Phe 






340 






Qlu Arg 


Gin 


Ser 


Thr Ser Arg 


Val 




355 






360 


Tyr Leu 


Arg Asp 


He Asp Gly 


Met 


370 






375 





Ala 


Val 


Phe 


VaL Val Cve 










285 




Glu 


Ala 


Leu 


Glv Sei* ThT^ 

vixjr UCJL. XXIX 


OCJU xxx&> 








300 






Phe 


Cys 


nXCt JJCU, 


oJ-Y Ayr 






315 




ion 
J ^ u 


Leu 




Ala 


irxie U6U ASp 


V7xu iisn 


Cys 


330 






335 


Phe 


Pro 


Leu Lys Met 


Arg Met 


345 






350 




Arg 


Asn 


Thr 


Val Gin Asp 


Pro Ala 








365 




Asn 


Lys 


Pro 


Val 










380 





<210> 30 
<211> 1154 
<212> DNA 

<213> Artificial Sequence . 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 30 

atggactccc cgatccagat cttccgcggg gagccgggcc ctacetgcgc cccgagcgcc 60 
tgcctgcccc ccaacagcag cgcctggttt cccggctggg ccgagcccga cagcaacggc 120 
agcgccggct cggaggacgc gcagctggag cccgcgcaca tctccccggc catcccggtc 180 
atcatcacgg cggtctactc cgtagtgttc gtcgtgggct tggtgggcaa ctcgctggtc 240 
atgttcgtga tcatccgata cacaaagatg aagacagcaa ccaacattta catatttaac 3 00 
ctggctttgg cagatgcttt agttactaca accatgccct ttcagagtac ggtctacttg 3 60 
atgaattcct ggccttttgg ggatgtgctg tgcaagatag taatttccat tgattactac 420 
aacatgttca ccagcatctt caccttgacc atgatgagcg tggaccgcta cattgccgtg 480 
tgccaccccg tgaaggcttt ggacttccgc acacccttga aggcaaagat catcaatatc 540 
tgcatctggc tgctgtcgtc afcctgttggc atctctgcaa tagtccttgg aggcaccaaa 600 
gtcagggaag acgtcgatgt cattgagtgc tccttgcagt tcccagatga tgactactcc 660 
tggtgggacc tcttcatgaa gatctgcgtc ttcatctttg ccttcgtgat ccctgtcctc 720 
atcatcatcg tctgctacac cctgatgatc ctgcgtctca agagcgtccg gctcctttct 780 
ggctcccgag agaaagatcg caacctgcgt aggatcacca gactggtcct ggtggtggtg 840 
gcagtcttcg tcgtctgctg gactcccatt cacatattca tcctggtgga ggctctgggg 900 
agcacctccc acagcacagc tgctctctcc agctattact tctgcatcgc cttaggctat 960 
accaacagta gcctgaatcc cattctctac gcctttcttg atgaaaactt caagcggtgt 1020 
ttccgggact tctgctttcc actgaagatg aggatggagc ggcagagcac tagcagagtc 1080 
cgaaatacag ttcaggatcc tgcttacctg agggacatcg atgggatgaa taaaccagta 1140 
tgactagtcg tgga 1154 

<210> 31 
<211> 380 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 31 

Met Glu Ser Pro He Gin He Phe Arg Gly Asp Pro Gly Pro Thr Cys 

15 10 15 

Ser Pro Ser Ala Cys Leu Leu Pro Asn Ser Ser Ser Tip Phe Pro Asn 
20 25 . 30 
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fll 11 

V7X vl 


oex Asp oer ABU 


nl \r 
vjxy 






35 




4.0 


Leii 


Glu 


Ser 


Axa ciJLo J.xts OCX 


JrX sJ 




50 








V ax 




Ser 


Val Val "DVk** Val 
vdx VdX trxlG vcLX 


\Ta 1 
vctx 


65 






/ u 






XrXXKi 


V CLX 


XXC XXC AX^^ xyx 


TViT* 
XXIX 












Tyr 


±xe 




Asn xjcu AXei lieu 


Ala 
AX a 








X V V 




Fro 


Irlie 


oxu 


oer AXa VaX lyr 


Leu 






1 1 ^ 
xx^ 




ion 


va.x 


Xi6U 


Cys 


T.T/« Tl « Val Tl A 

xiys xxe vsli xxe 


oer 




130 








D6r 


Tl 0 




^^Vi<** T.A^^ ^^l^v* IUIa^ 

xnr ijeu inr Jnec 


Utah 


145 






xs u 




Cys 




Pro 


VaX XiyS AXci lieU 


Asp 












Tl la 

±xe 


Tl o. 


Asn 


xxe vjys Xxe irp 


Leu 








1 Qfl 
XD V 




A J. a. 


Tl A 

xxe 


vaX 


Leu Gly Gly Thr 


Lys 




* 






9 nn 




Cys 


Ser 


Leu Gin Pne Pro 


Asp 




oi n 
zxu 




rfSXO 




It lie 


ne u 


Lys 


j.xe 'jys vax Fne 


Val 

Val 














Tl ca 

xxe 


Tl ^ 

xxe 


Tfa'! f^ra '^xtv* T1^*» 

VaX ^ys lyr xur 


Leu 














JJCU 


Lieu 


oex vsxy OCX Axg 


ni 11 








^ fin 




Til V 


Lys 


Leu 


vaj. ijeu va.± vax 


IT'S 1 

vai 






Z / 3 




0 pn 
z o u 


Pro 


Tl ci 

lie 


His 


lie pne lie iieu 


Val 












Ser 


xnr 


Ala 


Ala Leu Ser Ser 


Tyr 








P xu 




Thr 


ASXl 


Ser 


Ser Leu Asn Pro 


vai 








325 




Phe 


Lys 


Arg 


Cys Phe Arg Asp 


Phe 








340 




Glu 


Arg 


Gin 


Ser Thr Asn Arg 


Val 






355 




360 


Ser 


Met 


Arg 


Asp Val Gly Gly 


Met 




370 




375 





Car- 
OCX 


Val 

VaX 


^11 Tr 


Ser 


rsi 11 


Asp 


r«l Tl 


f21 T» 

v^xn 


















Al a 

AXCl 


Tl #» 
xxe 


Pro 


VaX 


Tl a 
XX6 


Tla 
XX6 


Thr* 
XXlX 


m a 
AXa 








60 










oxy 


Xj6U 


Val 
Vax 


^11 \r 

*jxy 


Asn 


Ser 


Leu 


Val 






75 










80 






Lys 


•r'hv 

XXlX 


&1 a 
AXa 


XXIX 


Asn 


Tl a 
X xe 




QO 










79 




Asp 


ai a 

AXa 


Leu 


vax 


Tnr 


xnr 


xnr 


Ma^ 

mec 












1 1 n 

XXU 






Mot- 


Asn 


Ser 


Trp 


Pro 


irne 


^•1 \y 
oxy 


Asp 










TOR 
X^3 








Tl A 

Xxe 


Asp 


Tyr 


Tyr 


Asn 


mec 


pne 


Tnr 








1 An 

JL*x\J 










Ser 


Val 

vax 


Asp 


Arg 


Tyr 


Tl A 

xxe 


Axa 


17a 1 

vax 






155 

X^9 










1 fin 

X O 




Arg 


Thr 


Pro 


Leu 


Lys 


al a 
AX a 


Lys 




X /y 










X /D 




Leu 


Ala 

Axa 


Ser 


Ser 


Val 

vax 


vixy 


Tl ca 

xxe 


Ser 


1 

lOD 










1 on 
xy u 






vax 


Arg 


r*i 11 

oXU 


Asp 


vax 


Asp 


vax 


Tl a 

xxe 










OAR 








ASp 


GXU 


Tyr 


Ser 


Trp 


Trp 


Asp 


Leu 


















nViA 

jrne 


Jkl 9 

Axa 




17a 1 

vax 


Tl a 

xxe 


Pro 


17a 1 

vax 


Leu 


















Pie u 


Tl a - 

xxe 


Leu 


Arg 


Leu 


Lys 


Cav> 

oer 


Val 

vax 


















Lys 


Asp 


Arg 


Asn 


Leu 


Arg 


Arg 


Tl a 

xxe 


*? c 

^ O 3 
















AX a 


vax 


13 Via 

Fne 


Tl 

xxe 


TT a 

xxe 


Cys 


Trp 


Thr 










*? Q R 

ZOO 








Glu 


Axa 


Leu 


Gly 


Ser 


Thr 


Ser 


His 








9 n n 










Tyr 


pne 


Cys 


xXe 


Ala 


Leu 


Gly 


Tyr 






O X9 










ion 


Leu 


Tyr 


Ala 


Phe 


Leu 


Asp 


Glu 


Asn 




330 










335 




Cys 


Phe 


Pro 


He 


Lys 


Met 


Arg 


Met 


345 










350 






Arg 


Asn 


Thr 


Val 


Gin 


Asp 


Pro 


Ala 










365 








Asn 


Lys 


Pro 


Val 











380 



<210.> 32 
<211> 1410 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note « 
synthetic construct 

<400> 32 

gcgcaccttg ctgatcccaa acaggcagag cttcttccag tcttggaagg cacaaattga 60 
gcatcaggaa cgtggaccca tcagggctga acagctactc aggatctaaa gtggtgactt 120 
ggaaagctga cggtgacttg ggaagggagg tcgccaatca gcgatctgga gctgcagcgc 180 
tcaccatgga gtcccccatt cagatcttcc gaggagatcc aggccctacc tgctctccca 240 
gtgcttgcct tctccccaac agcagctctt ggttccccaa ctgggcageia tccgacagta 300 
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atggcagtgt gggctcagag gatcagcagc tggagtccgc gcacatctct ccggccatcc 360 
ctgttatcat caccgctgtc tactctgtgg tatttgtggt gggcttagtg ggcaattctc 420 
tggtcatgtt tgtcatcatc cgatacacga agatgaagac cgcaaccaac atctacatat 480 
ttaacctggc tttggcagat gctttggtta ctaccactat gccctttcag agtgctgtct 540 
acttgatgaa ttcttggcct tttggagatg tgctatgcaa gattgtcatt tccattgact 600 
actacaacat gtttaccagc atattcacct tgaccatgat gagtgtggac cgctacattg 660 
ctgtgtgcca ccctgtgaaa gctttggact tccgaacacc tttgaaagca aagatcatca 720 
acatctgcat ttggctcctg gcatcatctg ttggtatatc agcgatagtc cttggaggca 780 
ccaaagtcag ggaagatgtg gatgtcattg aatgctcctt gcagtttcct gatgatgaat 840 
attcctggtg ggatctcttc atgaagatct gtgtcttcgt ctttgccttt gtgatcccag 900 
tcctcatcat cattgtctgc tacaccctga tgatcctgcg cctgaagagt gtccggctcc 960 
tgtctggctc ccgagagaag gaccgaaatc tccgccgcat caccaagctg gtgctggtag 1020 
tagttgcagt cttcatcatc tgttggaccc ccattcacat ctttatcctg gtggaggctc 1080 
tgggaagcac ctcccacagc acagctgccc tctccagcta ttatttctgt attgccttgg 1140 
gttataccaa cagcagcctg aatcctgttc tctatgcctt tctggatgaa aacttcaagc 1200 
ggtgttttag ggacttctgc ttccctatta agatgcgaat ggagcgccag agcaccaata 1260 
gagttagaaa cacagttcag -gatcctgctt ccatgagaga tgtgggaggg atgaataagc 1320 
cagtatgact agtcgtggaa atgtcttctt attgttctcc aggtagagaa gagttcaatg 1380 
atcttggttt aacccagatt acaactgcag 1410 

<210> 33 

<211> 380 ... 

<212> PRT ... 

<213> Artificial Sequence 

<220> 

<223> Description of 'Artificial Sequence : /note » 
synthetic construct 



<4.00> 33 ' 

Met Glu Ser Pro He Gin lie- Phe Arg Gly Glu Pro Gly Pro Thr Cys 

1 5 ' . 10 15 

Ala Pro Ser Ala Cys Leu Leu Pro Asn Ser Ser Ser Trp Phe Pro Asn 

20 25 30 

Trp Ala Glu Ser Asp Ser Asn Gly Ser Val Gly Ser Glu Asp Gin Gin 

35 40 45 

Leu Glu Pro Ala His lie Ser Pro Ala lie Pro Val lie lie Thr Ala 

50 55 60 

Val Tyr Ser Val Val Phe Val Val Gly Leu Val Gly Asn Ser Leu Val 
65 70 75 80 

Met Phe Val He lie Arg Tyr Thr Lys Met Lys Thr Ala Thr Asn He 

85 90 95 

Tyr He Phe Asn Leu Ala Leu Ala Asp Ala Leu Val Thr Thr Thr Met • 

100 105 110 

Pro Phe Gin Ser Ala Val Tyr Leu Met Asn Ser Trp Pro Phe Gly Asp 

115 120 125 

Val Leu Cys Lys He Val He Ser He Asp Tyr Tyr Asn Met Phe Thr 

130 135 140 

Ser He Phe Thr Leu Thr Met Met Ser Val Asp Arg Tyr He Ala Val 
145 150 155 160 

Cys His Pro Val Lys Ala Leu Asp Phe Arg Thr Pro Leu Lys Ala Lys 

165 170 175 

He He Asn He Cys He Trp Leu Leu Ala Ser Ser Val Gly He Ser 

180 185 190 

Ala He Val Leu Gly Gly Thr Lys Val Arg Glu Asp Val Asp Val He 

195 200 205 

Glu Cys Ser Leu Gin Phe Pro Asp Asp Glu Tyr Ser Trp Trp Asp Leu 

210 215 220 

Phe Met Lys He Cys Val Phe Val Phe Ala Phe Val He Pro Val Leu 
225 230 235 240 
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lie 


lie 


He Val 


Cys 


Tyr Thr Leu 


Met He Leu T^g Leu 


Lys Ser Val 








245 




250 


255 


Arq 


Iseu 


Leu Ser 


Gly 


Ser Arg Glu 


Lys Asp Arg Asn Leu 


Arg Arg He 






260 






265 


270 


Thr 


Lys 


Leu Val 


Leu 


Val Val Val 


Ala Val Phe He He 


Cys Trp Thr 






275 




280 


285 




Pro 


lie 


His He 


Phe 


He Leu Val 


Glu Ala Leu Gly Ser 


Thr Ser His 




290 






295 


300 




Sex 


Thr 


Ala Val 


Leu 


Ser Ser Tyr 


Tyr Phe Cys He Ala 


Leu Gly Tyr 


305 








310 


315 


320 


Thr 


Asn 


Ser Ser 


Leu 


Asn Pro Val 


Leu Tyr Ala Phe Leu 


Asp Glu Asn 








325 




330 


335 


Phe 


Lys 


Arg Cys 


Phe 


Arg Asp Phe 


Tyr Phe Pro He Lys 


Met Arg Met 






340 






345 


350 


Glu 


Arg 


Qln Ser 


Thr 


Asn Arg Val 


Arg Asn Thr Val Gin 


Asp Pro Ala 






355 




360 


365 




Ser 


Met 


Arg Asp 


Val 


Gly Gly Met 


Asn Lys Pro Val 






370 






375 


380 





<210> 34 
<211> 2481 
<212> DNA 

<213> Artificial Sequence 

<220>' ' 

<223> Description of Artificial Sequence : /note- » 
• synthetic construct 

<400> 34 

tccagccctg cctgcacagg caaagtttgt ctctctgggt ccaatctgtc ctctgctcct 60 
gcagcccggc aagtgccatc gctcctcggt ttccagctgc agcactcacc atggagtccc 120 
ccatccagat tttccgcgga gagccaggcc ctacctgtgc tcccagtgct tgcctactcc 180 
ccaacagcag ctcttggttc cccaactggg ccgaatcgga cagcaatggc agtttgggct 240 
ccgaggacca gcagctggag cccgcgcaca tctctccagc catccctgtt atcatcaccg 300 
ctgtctactc tgtggtgttt gtggtgggct tagtgggcaa ttccctggtc atgtttgtca 360 
tcatccgata cacaaagatg aagaccgcaa ccaacatcta catatttaac ctggctttgg 420 
cagatgcttt ggttactacc actatgccct tccagagtgc tgtctacttg atgaattctt 480 
ggccttttgg agatgttctg tgcaagattg tcatttccat tgactactac aacatgttta 540 
ccagcatatt caccttgacc atgatgagtg tggaccgcta cattgccgtg tgccaccctg 600 
tgaaagcttt ggatttccga acacctttga aagcaaagat catcaacatc tgcatttggc 660 
tactggcatc atctgttggt atatcagcga tagtccttgg aggcaccaaa ' gtcagggaag 720 
atgtggatgt cattgaatgc tccttgcagt ttcctgatga tgaatattcc tggtgggacc 780 
tcttcatgaa gatctgtgtc ttcgtctttg cctttgttat ccctgtctta atcatcattg 840 
tctgctacac cctgatgatc ctgcgcttga agagtgtccg gctcctctcg ggctctcgag 900 
agaaggaccg aaatctccgc cggatcacca agctggtgct ggtagtggtt gcagtcttca 960 
tcatctgttg gacccccatc cacatcttta tcctggtcga ggctctaggc agcacctctc 1020 
acagcacagc tgtcctctct agctattact tctgcattgc cttgggttat accaacagca 1080 
gcttgaatcc tgttctctat gcctttcttg atgaaaactt caagcggtgt tttagggact 1140 
tctgcttccc cattaagatg cgaatggagc gccagagcac aaacagagtt agaaacacag 1200 
ttcaggatcc tgcttccatg agggatgtgg gtgggatgaa taagccagta tgactagtca 1260 
tggaaatgtc ttcctattgt tctccgggta gagaagagtt caatgatctt ggtttaaccc 1320 
agattaccac tgcagtctga agaggaaaga tgaggtattc aataacttag ccatgttatg 1380 
caatctaaag gtgcagggca cattagtgat ctaggctgag taggggcagc aagtgtgaag 1440 
aacagagcac atgtcctggc aacaatacac ctctttccta ggacagagga gaaggcaatc 1500 
taacctcaac ccttcgataa acagacagca ctctttcttc tggtcccctt gatttactgc 1560 
acctccatct gcgtggcctt tctgtgtaac atagttccaa agctctagag aagaaaatga 1620 
aagaaaaagt gcatttgatc caaaacttac tgggcaccca accttcgcgt taacacaggc 1680 
aaaaccaact tcgtatacaa tgaggccata ccaagggtca tatccaactc actttctgct 1740 
ggtgatcatg tcttatgtgt gatgagagtg aaacttagga agaacctgga ctcagaccct 1800 
gacactgggg gagagcacca tattgacatt tgtgaaccta tttaaagttg tgggtgttct 1860 
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cgtctcacag tgtctaatgc cttgaaaaac tacagttgct tcttaaggtc tctggttttt 1920 
agcatgctat tcaggaagat aatcttctga gaaaacatga actgatatta aaaggttgaa 1980 
gcttaatacc agcaaagtgt gtgtaatttc atctgtaaat agtggtctgt atataaataa 2040 
ggaccaggtt ttcctgtcca gcctgtacat ttctcaagga tgccgtagac acacccctgg 2100 
aggcatggaa agttcatgct gggatatttt gcttcactat aagctacttt cttgatttgg 2160 
tcttggtgtg atttctacta gattactcaa acattattta ctctaacact gatcataact 2220 
tggtgttaac aattccccaa actttgaatt cattctaaag tgttagcatt gatcaaatct 2280 
actttgtggt agcatctgtt tgtaaacaca cacatattgc cagattctct actcaggtag 2340 
aggaagttgc tttgatcatg tacaccttca aatgttatgq tctggctttc cacagaaagt 2400 
ggaattgttt caaaatgcat gctgaaaaag gaaataggat ttgagatggc ttagcacaat 2460 
ttgcatggta ttgagtaaga g 2481 

<210> 35 
<211> 21 
<212> DNA 

<213> Artificial Sequence* 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<210> 36 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220>. 

<223> Description of Artificial Sequence : /note = 
synthetic construct • 

<400> 36 

tttatcgcca atccacatct 20 



<2ii> 20 
<212> DNA 

<213> Artificial Sequence 
<220> .... 

<223> Description of Artificial Secpaence : /note » 
synthetic construct 

<400> 37 

cccatagtaa cgccaatagg 20 

<210> 38 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note « 
synthetic construct 



<400> 35 

tttttccagt tccgtttatc c • 



21 



<210> 37 



<400> 38 

aaatgtgagc gagtaacaac c 



21 



<210> 39 
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<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 39 

accacagtcc atgccatcac 

<210> 40 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> ' 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 40 

tccaccaccc tgttgctgta 

<210> 41 

<211> 23 
•<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 41 

cacctaatac gactcactat agg 

<210> 42 . 
<211> 18 
<212> DNA 

<213> Artificial Sec[uence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 42 

cattaaccct cactaaag 

<210> 43 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note 
synthetic construct 

<400> 43 

atatcgatat cgctagcttt taaaagaaaa ggggg 
<210> 44 
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<211> 39 
<212> DNA 

<213> Artificial Sec[uence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<:400> 44 

taatcgatgc taagcaaaat tttgaatttt tgtaatttg 39 

<210> 45 
<211> 22 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<210> 46 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<400> 46 

gcagacgatg aacacagc 18 

<210> 47 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 



<210> 48 
<211> 10472 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note » 
synthetic construct 

<400> 48 

aatgtagtct tatgcaatac tcttgtagtc ttgcaacatg gtaacgatga gttagcaaca 60 
tgccttacaa. ggagagaaaa agcaccgtgc atgccgattg gtggaagtaa ggtggtacga 120 



<400> 45 

gaattaccta atgggaacat gg 



22 



<400> 47 
accacagcaa tcac 



14 
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tcgtgcctta ttaggaaggc aacagacggg 
gccgcattgc agagatattg tatttaagtg 
gttagaccag atctgagcct gggagctctc 
tcaataaagc ttgccttgag tgcttcaagt 
taactagaga tccctcagac ccttttagtc 
aacagggact tgaaagcgaa agggaaacca 
gctgaagcgc gcacggcaag aggcgagggg 
actagcggag gctagaagga gagagatggg 
attagatcgc gatgggaaaa aattcggtta 
aaacatatag tatgggcaag cagggagcta 
gaaacatcag aaggctgtag acaaatactg 
tcagaagaac ttagatcatt atataataca 
atagagataa aagacaccaa ggaagcttta 
aagaccaccg cacagcaagc ggccgctgat 
caattggaga agtgaattat ataaatataa 
acccaccaag gcaaagagaa gagtggtgca 
tttgttcctt gggttcttgg gagcagcagg 
gacggtacag gccagacaat tattgtctgg 
ggctattgag gcgcaacagc atctgttgca 
ggcaagaatc ctggctgtgg aaagatacct 
ttgctctgga aaactcattt gcaccactgc 
atctctggaa cagatttgga atcacacgac 
ttacacaagc ttaatacact ccttaattga 
acaagaatta ttggaattag ataaatgggc 
ttggctgtgg tatataaaat tattcataat 
agtttttgct gtactttcta tagtgaatag 
tcagacccac ctcccaaccc cgaggggace 
tggagagaga gacagagaca gatccattcg 
agcttgggag ttccgcgtta cataacttac 
cgacccccgc ccattgacgt caataatgac 
tttccattga cgtcaatggg tggagtattt 
agtgtatcat atgccaagta cgccccctat 
gcattatgcc cagtacatga ccttatggga 
agtcatcgct attaccatgg tgatgcggtt 
gtttgactca cggggatttc caagtctcca 
gcaccaaaat caacgggact ttccaaaatg 
gggcggtagg cgtgtacggt gggaggtcta 
gatcgcctgg agacgccatc cacgctgttt 
gatccactag tccagtgtgg tggaattgat 
agctcctcct ctgctcgccc aatccttcca 
aatgtctgct cctgtatggg acatttgccc 
ctagcttttg ctgctccaaa gttttaaaag 
agggtggagg aggaatgagg gaattgcatc 
ttccagggcc cagagtggct tccaggaggt 
aacactcaga gcccattttc ctgcgttaac 
gatcagtgac ctgggggagc tggccaaact 
gctgctggac aacccctgtg ccgatgagac 
ggcacaccta gagcgcctag acaaagagta 
ggagatccga cagaggctga aggaggaaca 
ggaaccgtac ctcccgccaa cttagtggct 
ggcaggaagg cagcccccgg aggtcaaagg 
gaggctgcgg gtatctcaga tatgaaggaa 
aagacacaag agaccagaga agggagaaga 
ctacagacat agctggtaga gactgggagg 
ggaggtggct ggtggtatat ggaggatgta 
aatctgaagc taaaaataac aggacacggg 
cagagagact gagaggcctg gggatgtggg 
ttctcttttt ccaggaggcc aragatgctg 
accaccgcat tcatagccct gttacaagaa 
gaaatccatt acatcccgag gctcaggttc 
gtgggcctac ctaaagtcct aagcacagct 



tctgacatgg attggacgaa ccactgaatt 180 
cctagctcga tacataaacg ggtctctctg 240 
tggctaacta gggaacccac tgcttaagcc 300 
agtgtgtgcc cgtctgttgt gtgactctgg 360 
agtgtggaaa atctctagca gtggcgcccg 420 
gaggagctct ctcgacgcag gactcggctt 480 
cggcgactgg tgagtacgcc aaaaattttg 540 
tgcgagagcg tcagtattaa gcgggggaga 600 
aggccagggg gaaagaaaaa atataaatta 660 
gaacgattcg cagttaatcc tggcctgtta 720 
ggacagctac aaccatccct tcagacagga 780 
gtagcaaccc tctattgtgt gcatcaaagg 840 
gacaagatag aggaagagca aaacaaaagt 900 
cttcagacct ggaggaggag atatgaggga 960 
agtagtaaaa attgaaccat taggagtagc 1020 
gagagaaaaa agagcagtgg gaataggagc 1080 
aagcactatg ggcgcagcgt caatgacgct 1140 
tatagtgcag cagcagaaca atttgctgag 1200 
actcacagtc tggggcat ca agcagctcca 1260 
aaaggatcaa cagctcctgg ggatttgggg 1320 
tgtgccttgg aatgctagtt ggagtaataa 1380 
ctggatggag tgggacagag aaattaacaa 1440 
agaatcgcaa aaccagcaag aaaagaatga 1500 
aagtttgtgg aattggttta acataacaaa 1560 
gatagtagga ggcttggtag gtttaagaat 1620 
agttaggcag ggatattcac cattatcgtt 1680 
cgacaggccc gaaggaatag aagaagaagg 1740. 
attagtgaac ggatctcgac ggtatcgata 1800 
ggtaaatggc ccgcctggct gaccgcccaa 1860 
gtatgttccc atagtaacgc caatagggac 1920 
acggtaaact gcccacttgg cagtacatca 1980* 
tgacgtcaat gacggtaaat ggcccgcctg 2 04 0 
ctttcctact tggcagtaca tctacgtatt 2100 
ttggcagtac atcaatgggc gtggatagcg 2160 
ccccattgac gtcaatggga gtttgttttg 2220 
tcgtaacaac tccgceccat tgacgcaaat 2280 
tataagcaga gctcgtttag tgaaccgtca 2340-,. 
tgacctccat agaagacacc gactctagag 2400 
cccttcacct aatacgactc actataggct 2460 
accccctatg gtggtatggc tgacacagaa 2S20 
ctcttctcca aatataagac aggatgaggc 2580 
aacacattgc acggcattta gggactctaa 2640 
atgccaaggc tggtcctcat ccatcactgc 2700 
attcttacaa aggaagcccg atctgtagct 2760 
ccctcccgac ctcatataca ggagtaacat 2820 
gcgggacctg cccaagctga gggccttggt 2880 
tgactaccgc caggaggccc tggtgcagat 2940 
ctatgaggac gaggaccggg cagaagctga 3000 
ggagcaagaa ctcgacccgg accaagacat 3060 
octet agcct gcagggacag taaaggtgat 3120 
ctgggcacgc gggaggagag gccagagtca 318 0 
agatgagaga ggctcaggaa gaggtaagaa 3240 
attagagagg gaggcagagg accgctgtct 330 0 
aagggatgaa ccctgagcgc atgaagggaa 3360 
gctgggccag ggaaaagatc ctgcactaaa 3420 
gtggagaggc gaaaggaggg cagattgagg 3480 
cattccggta gggcacacag ttcacttgtc 3540 
acctcaagaa ctcataatac cccagtgggg 3600 
gtgggagatg ttcctttttg tcccagactg 3660 
tgtggtggtc atctctgtgt ggcttgttct 3720 
ctcaagcaga tccgaggcga ctaagatgct 3780 
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agtaggggtt 
tcaaataaaa 
tccatctgtc 
gagttggctg 
gatcgcccgg 
cgtctccgcc 
ctataggcgc 
cctgctctct 
ctcgaggaac 
caggtcccgg 
tacttctagg 
ggccaagcta 
ggggctatac 
tctctgtctc 
aggaagcggc 
gcggtgcccg 
attgcactga 
tcaacttgtc 
tgggcgggag 
cgatcatggc 
tgtatgtgat 
ttgctctggc 
tgggaacatg 
acatgttcac 
gccaccctgt 
gcaactggat 
a'caggcaagg 
acctgctgaa 
tgtgctatgg 
aaaaggacag 
tcgtctgctg 
aaactacgtt 
gcctcaaccc 
tctgtatccc 
tctagagacc 
ctggaagcag 
gttaatgaag 
tctcctcggt 
aatgtgtgtc 
aagcatgcat 
cagaagtatg 
gcccatcccg 
tttttttatt 
aggaggcttt 
tttcggatct 
taatacgaca 
ctcattgaaa 
gccagcgcag 
actgggggac 
aacctgactt 
cggtgccgac 
gatggacagc 
ggctaagcac 
tcttagccac 
acaagatctg 
gctctctggc 
tcaagtagtg 
ttagtcagtg 
ataacttgca 
tggttacaaa 
ttctagttgt 



gtctggagag 
aggcgttttt 
tgggttagta 
gacaaggtta 
gcccttcccc 
tgcagctctt 
cgccccctgc 
ctgcgccgcc 
tgaaaaacca 
atcb^gtggt 
cctgtacgga 
agcttgatat 
gcagaggaga 
agccaggact 
tgaggcgctt 
cccggccgtc 
tgccttggcg 
ccacttagat 
agacagcctg 
cctctactcc 
tgtcagatac 
agatgcctta 
gccatttgga 
cagcatattc 
caaggcctta 
cctctcttca 
ttccatagat 
gatctgtgtt 
actgatgatc 
gaatcttcga 
gactcccatt 
ccagactgtt 
agtcctttat 
aacctcttcc 
acccctccac 
aaactgctcc 
ggctcgagtc 
ctcgattcta 
agttagggtg 
ctcaattagt 
caaagcatgc 
cccctaactc 
tatgcagagg 
tttggaggcc 
gatcagcacg 
aggtgaggaa 
gagcaacggc 
ctctctctag 
cttgtgcaga 
gtatcgtcgc 
aggtgcttct 
cgacggcagt 
aattcgagct 
tttttaaaag 
ctttttgctt 
taactaggga 
tgtgcccgtc 
tggaaaatct 
aagaaatgaa 
taaagcaata 
ggtttgtcca 



aagagccgag 
atattctgtg 
cctgccacta 
tgagcatccg 
cacccgttcg 
gactcatcgg 
ccaccccccg 
gccgtcacca 
gaaagttaac 
ggtgcaaatc 
agtgttactt 
cgaattccgg 
atgtcagatg 
ggtttctgta 
ggaacccgaa 
agtaccatgg 
tacticaagtt 
ggcaacctgt 
tgccctccga 
atcgtgtgcg 
accaagatga 
gccaccagta 
accatccttt 
accctctgca 
gatttccgta 
gccattggtc 
tgtacactaa 
ttcatcttcg 
ttgcgcctca 
aggatcacca 
cacatttacg 
tcttggcact 
gcatttctgg 
aacattgagc 
ggccaataca 
gttgcccgtc 
tagagggccc 
cgcgtaccgg 
tggaaagtcc 
cagcaaccag 
atctcaatta 
cgcccagttc 
ccgaggccgc 
taggcttttg 
tgttgacaat 
ctaaaccatg 
tacaatcaac 
cgacggccgc 
actcgtggtg 
gatcggaaat 
cgatctgcat 
tgggattcgt 
cggtaccttt 
aaaagggggg 
gtactgggtc 
acccactgct 
tgttgtgtga 
ctagcagtag 
tatcagagag 
gcatcacaaa 
aactcatcaa 



gaggtgggct 
tcgagttcgt 
tactggaata 
tgtacttatg 
gttccccacc 
ggcccccggg 
cccgcgctgg 
ccgccaccgc 
tggtaagttt 
aaagaactgc 
ctgctctaaa 
atgagcctct 
ctcagctcgg 
agaaacagca 
aagtctcggt 
acagcagcgc 
gctccccagc 
ccgacccatg 
ccggcagtcc 
tggtggggct 
agactgccac 
ccctgccctt 
gcaagatagt 
ccatgagtgt 
ctccccgaaa 
ttcctgtaat 
cattctctca 
ccttcattat 
agagtgtccg 
ggatggtgct 
tcatcattaa 
tctgcattgc 
atgaaaactt 
aacaaaactc 
gtggatagaa 
gacccgggcg 
gcggttcgaa 
ttagtaatga 
ccaggctccc 
gtgtggaaag 
gtcagcaacc 
cgcccattct 
ctctgcctct 
caaaaagctc 
taatcatcgg 
gccaagcctt 
agcatcccca 
atcttcactg 
ctgggcactg 
gagaacaggg 
cctgggatca 
gaattgctgc 
aagaccaatg 
actggaaggg 
tctctggtta 
taagcctcaa 
ctctggtaac 
tagttcatgt 
tgagaggaac 
tttcacaaat 
tgtatcttat 



gtgatggatc 
gaacccctgt 
aggggacgcc 
gggttgccag 
accacccgcg 
tcacatgcgc 
gagccgcagc 
caccggctga 
agtctttttg 
tcctcagtgg 
agctgcggaa 
gtgaactact 
tcccctccgc 
ggagctgtgg 
gctcctggct 
tgcccccacg 
acccagcccc 
cggtccgaac 
ctccatgatc 
cttcggaaac 
caacatctac 
ccagagtgtg 
gatctccata 
tgatcgatac 
tgccaaaatt 
gttcatggct 
tccaacctgg 
gccagtgctc 
catgctctct 
ggtggtggtg 
agccttggtt 
tctaggttac 
caaacgatgc 
cactcgaatt 
ctaatcatca 
gccgcttccc 
ggtaagccta 
gtttggaatt 
caggcaggca 
tccccaggct 
atagtcccgc 
ccgccccatg 
gagctattcc 
ccgggagctt 
catagtatat 
tgtcteaaga 
tctctgaaga 
gtgtcaatgt 
ctgctgctgc 
gcatcttgag 
aagccatagt 
cctctggtta 
acttacaagg 
ctaattcact 
gaccagatct 
taaagcttgc 
tagagatccc 
catcttatta 
ttgtttattg 
aaagcatttt 
catgtctggc 



agttcagctt 
ggtgggcttc 
tgcttccctc 
cttggtcctg 
ctcgtacgtg 
tcgctcggct 
cgccgccact 
gtctgcagtc 
tcttttattt 
atgttgcctt 
ttgtacccgc 
aaggtgggag 
ctgacgctcc 
cagcggcgaa 
accbcgcaca 
aacgccagca 
ggttcctggg 
cgcaccgacc 
acggccatca 
ttcctggtca 
attttcaacc 
aattacctaa 
gattactata 
attgcagtct 
atcaatgtct 
acaacaaaat 
tactgggaaa 
atcattaccg 
ggctcc.aaag 
gctgtgttca 
acaatcccag 
acaaacagct 
ttcagagagt 
cgtcagaaca 
gctagaaaat 
tttagtgagg 
tccctaaccc 
aattctgtgg 
gaagtatgca 
ccccagcagg 
ccctaactcc 
gctgactaat 
agaagtagtg 
gtatatccat 
cggcatagta 
agaatccacc 
ctacagcgtc 
atatcatttt 
ggcagctggc 
cccctgcgga 
gaaggacagt 
tgtgtgggag 
cagctgtaga 
cccaacgaag 
gagcctggga 
cttgagtgct 
tcagaccctt 
ttcagtattt 
cagcttataa 
tttcactgca 
tctagctatc 



3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400- . 

5460 

5520 

5580 

5640 

5700 

5760 

5820 

5880 

5940. , 

6000; 

6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
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ccgcccctaa 
catggctgac 
ttccagaagt 
gtgagtcgta 
ctggcgttac 
gcgaagaggc 
acgcgccctg 
ctacacttgc 
cgttcgccgg 
gtgctttacg 
catcgccctg 
gactcttgtt 
aagggatttt 
acgcgaattt 
gcgcggaacc 
acaataaccc 
tttccgtgtc 
agaaacgctg 
cgaactggat 
aatgatgagc 
gcaagagcaa 
agtcacagaa 
aaccatgagt 
gctaaccgct 
ggagctgaat 
aac.aacgttg 
aatagactgg 
tggctggttt 
agcactgggg 
ggcaactatg 
ttggtaactg 
ttaatttaaa 
acgtgagttt 
agatcctttt 
ggtggtttgt 
cagagcgcag 
gaactctgta 
cagtggcgat 
gcagcggtcg 
caccgaactg 
aaaggcggac 
tccaggggga 
gcgtcgattt 
ggccttttta 
atcccctgat 
cagccgaacg 
caaaccgcct 
cgactggaaa 
accccaggct 
acaatttcac 
ctaaagggaa 



ctccgcccat 

taattttttt 

agtgaggagg 

ttacgcgcgc 

ccaacttaat 

ccgcaccgat 

tagcggcgca 

cagcgcccta' 

ctttccccgt 

gcacctcgac 

atagacggtt 

ccaaactgga 

gccgatttcg 

taacaaaata 

cctatttgtt 

tgataaatgc 

gcccttattc 

gtgaaagtaa 

ctcaacagcg 

acttttaaag 

ctcggtcgcc 

aagcatctta 

gataacactg 

tttttgcaca 

gaagccatac 

cgcaaactat 

atggaggcgg 

attgctgata 

ccagatggta 

gatgaacgaa 

tcagaccaag 

aggatctagg 

tcgttccact 

tttctgcgcg 

ttgccggatc 

ataccaaata 

gcaccgccta 

aagtcgtgtc 

ggctgaacgg 

agatacctac 

aggtatccgg 

aacgcctggt 

ttgtgatgct 

cggttcctgg 

tctgtggata 

accgagcgca 

ctccccgcgc 

gcgggcagtg 

ttacacttta 

acaggaaaca 

caaaagctgg 



cccgccccta 
tatttatgca 
cttttttgga 
tcactggccg 
cgccttgcag 
cgcccttccc 
ttaagcgcgg 
gcgcccgctc 
caagctctaa 
cccaaaaaac 
tttcgccctt 
acaacactca 
gcctattggt 
ttaacgctta 
tatttttcta 
ttcaataata 
ccttttttgc 
aagatgctga 
gtaagatcct 
ttctgctatg 
gcatacacta 
cggatggcat 
cggccaactt 
acatggggga 
caaacgacga 
taactggcga 
ataaagttgc 
aatctggagc 
agccctcccg 
atagacagat 
tttactcata 
tgaagatcct 
gagcgtcaga 
taatctgctg 
aagagctacc 
ctgttcttct 
catacctcgc 
ttaccgggtt 
ggggttcgtg 
agcgtgagct 
taagcggcag 
atctttatag 
cgtcaggggg 
ccttttgctg 
accgtattac 
gcgagtcagt 
gttggccgat 
agcgcaacgc 
tgcttccggc 
gctatgacca 
agctgcaagc 



actccgccca 
gaggccgagg 
ggcctaggga 
tcgttttaca 
cacatccccc 
aacagttgcg 
cgggtgtggt 
ctttcgcttt 
atcgggggct 
ttgattaggg 
tgacgttgga 
accctatctc 
taaaaaatga 
caatttaggt 
aatacattca 
ttgaaaaagg 
ggcattttgc 
agatcagttg 
tgagagtttt 
tggcgcggta 
ttctcagaat 
gacagtaaga 
acttctgaca 
tcatgtaact 
gcgtgacacc 
actacttact 
aggaccactt 
cggtgagcgt 
tatcgtagtt 
cgctgagata 
tatactttag 
ttttgataat 
ccccgtagaa 
cttgcaaaca 
aactcttttt 
agtgtagccg 
tctgctaatc 
ggactcaaga 
cacacagccc 
atgagaaagc 
ggtcggaaca 
tcctgtcggg 
gcggagccta 
gccttttgct 
cgcctttgag 
gagcgaggaa 
tcattaatgc 
aattaatgtg 
tcgtatgttg 
tgattacgcc 
tt 



gttccgccca 
ccgcctcggc 
cgtacccaat 
acgtcgtgac 
tttcgccagc 
cagcctgaat 
ggttacgcgc 
cttcccttcc 
ccctttaggg 
tgatggttca 
gtccacgttc 
ggtctattct 
gctgatttaa 
ggcacttttc 
aatatgtatc 
aagagtatga 
cttcctgttt 
ggtgcacgag 
cgccccgaag 
ttatcccgta 
gacttggttg 
gaattatgca 
acgatcggag 
cgccttgatc 
acgatgcctg 
ctagcttccc 
ctgcgctcgg 
gggtctcgcg 
atctacacga 
ggtgcctcac 
attgatttaa 
ctcatgacca 
aagatcaaag 
aaaaaaccac 
ccgaaggtaa 
bagttaggcc 
ctgttaccag 
cgatagttac 
agcttggagc 
gccacgcttc 
ggagagcgca 
tttcgccacc 
tggaaaaacg 
cacatgttct 
tgagctgata 
gcggaagagc 
agctggcacg 
agttagctca 
tgtggaattg 
aagcgcgcaa 



ttctccgccc 
ctctgagcta 
tcgccctata 
tgggaaaacc 
tggcgtaata 
ggcgaatggg 
agcgtgaccg 
tttctcgcca 
ttccgattta 
cgtagtgggc 
tttaatagtg 
tttgatttat 
caaaaattta 
ggggaaatgt 
cgctcatgag 
gtattcaaca 
ttgctcaccc 
tgggttacat 
aacgttttcc 
ttgacgccgg 
agtactcacc 
gtgctgccat 
gaccgaagga 
gttgggaacc 
tagcaatggc 
ggcaacaatt 
cccttccggc 
gtatcattgc 
cggggagtca 
tgattaagca 
aacttcattt 
aaatccctta 
gatcttcttg 
cgctaccagc 
ctggcttcag 
accacttcaa 
tggctgctgc 
cggataaggc 
gaacgaccta 
ccgaagggag 
cgagggagct 
tctgacttga 
ccagcaacgc 
ttcctgcgtt 
ccgctcgccg 
gcccaatacg 
acaggtttcc 
ctcattaggc 
tgagcggata 
ttaaccctca 



7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8580 

8640 

8700 

8760 

8820 

8880 

8940 

9000 

90.60 

9120 

9180 

9240 

9300 

9360 

9420 

9480. 

9540. 

9600. 

9660- 

9720 

9780 

9840 

9900 

9960 

1O02O 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10472 
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synthetic construct 
<400> 49 

ttggcccatt gcatacgttg tatccatatc 
caacattacc gccatgttga cattgattat 
ggtcattagt tcatagccca tatatggagt 
cgcctggctg accgcccaac gacccccgcc 
tagtaacgbc aatagggact ttccattgac 
cccacttggc agtacatcaa gtgtatcata 
acggtaaatg gcccgcctgg cattatgccc 
ggcagtacat ctacgtatta gtcatcgcta 
tcaatgggcg tggatagcgg tttgactcac 
tcaatgggag tttgttttgg caccaaaatc 
ccgccccatt gacgcaaatg ggcggtaggc 
ctcgtttagt gaaccgtcag atcgcctgga 
gaagacaccg ggaccgatcc agcctcccct 
cctgagaact tcagggtgag ■ tctatgggac 
tggttaagtt catgtcatag gaaggggaga 
gggtaatttt gcatttgtaa ttttaaaaaa 
tatcttattt ctaatacttt ccctaatctc 
tcatgcctct ttgcaccatt ctaaagaata 
caatatttct gcatataaat atttctgcat 
attgctaata gcagctacaa tccagctacc 
gctggattat tctgagtcca agctaggccc 
tcctcccaca gctcctgggc aacgtgctgg 
cacgtgagat ctgaattcga gatctgccgc 
cgggggagaa ttagatcgat gggaaaaaat 
taaattaaaa catatagtat gggcaagcag 
cctgttagaa acatcagaag gctgtagaca 
gacaggatca gaagaactta gatcattata 
tcaaaggata gagataaaag acaccaagga 
caaaagtaag aaaaaagcac agcaagcagc 
ccaaaattac cctatagtgc agaacatcca 
tagaacttta aatgcatggg taaaagtagt 
acccatgttt tcagcattat cagaaggagc 
cacagtgggg ggacatcaag cagccatgca 
tgcagaatgg gatagagtgc atccagtgca 
agaaccaagg ggaagtgaca tagcaggaac 
gatgacacat aatccaccta tcccagtagg 
attaaataaa atagtaagaa tgtatagccc 
aaaggaaccc tttagagact atgtagaccg 
ttcacaagag gtaaaaaatt ggatgacaga 
ttgtaagact attttaaaag cattgggacc 
atgtcaggga gtggggggac ccggccataa 
agtaacaaat ccagctacca taatgataca 
tgttaagtgt ttcaattgtg gcaaagaagg 
gaaaaagggc tgttggaaat gtggaaagga 
acaggctaat tttttaggga agatctggcc 
tcagagcaga ccagagccaa cagccccacc 
aacaactccc tctcagaagc aggagccgat 
cagatcactc tttggcagcg acccctcgtc 
gctctattag atacaggagc agatgataca 
tggaaaccaa aaatgatagg gggaattgga 
atactcatag aaatctgcgg acataaagct 
gtcaacataa ttggaagaaa tctgttgact 
agtcctattg agactgtacc agtaaaatta 
caatggccat tgacagaaga aaaaataaaa 
aaggaaggaa aaatttcaaa aattgggcct 
ataaagaaaa aagacagtac taaatggaga 
agaactcaag atttctggga agttcaatta 
aaaaaatcag taacagtact ggatgtgggc 



ataatatgta catttatatt ggctcatgtc 60 
tgactagtta ttaatagtaa tcaattacgg 120 
tccgcgttac ataacttacg gtaaatggcc 180 
cattgacgtc aataatgacg tatgttccca 240 
gtcaatggcft ggagtattta cggtaaactg 300 
tgccciagtac gccccctatt gacgtcaatg 360 
agtacatgac cttatgggac tttcctactt 420 
ttaccatggt gatgcggttt tggcagtaca 480 
ggggatttcc aagtctccac cccattgacg 540 
aacgggactt tccaaaatgt cgtaacaact 600 
gtgtacggtg ggaggtctat ataagcagag 660 
gacgccatcc acgctgtttt gacctccata 720 
cgaagcttac atgtggtacc gagctcggat 780 
ccttgatgtt ttctttcccc ttcttttcta 840 
agtaacaggg tacacatatt gaccaaatca 900 
tgctttcttc ttttaatata cttttttgtt 960 
tttctttcag ggcaataatg atacaatgta 1020 
acagtgataa tttctgggtt aaggcaatag 1080 
ataaattgta actgatgtaa gaggtttcat 1140 
attctgcttt tattttatgg ttgggataag 1200 
ttttgctaat catgttcata cctcttatct 1260 
tctgtgtgct ggcccatcac tttggcaaag 1320 
cgccatgggt gcgagagcgt cagtattaag 1380 
tcggttaagg ccagggggaa agaaaaaata 1440 
ggagctagaa cgattcgcag ttaatcctgg 1500 
aatactggga cagctacaac catcccttca 1560 
taatacagta gcaaccctct attgtgtgca 1620 
agctttagac aagatagagg aagagcaaaa 168.0 
agctgacaca ggacacagca atcaggtcag 1740 
ggggcaaatg gtacatcagg ccatatcacc 1800 
agaagagaag gctttcagcc cagaagtgat I860, 
caccccacaa gatttaaaca ccatgctaaa 1920;. 
aatgttaaaa gagaccatca atgaggaagc 1980 
tgcagggcct attgcaccag gccagatgag 2040^ 
tactagtacc cttcaggaac aaataggatg 2100 
agaaatctat aaaagatgga taatcctggg 2160 
taccagcatt ctggacataa gacaaggacc 222 0 
attctataaa actctaagag ccgagcaagc 2280 
aaccttgttg gtccaaaatg cgaacccaga 2340 
aggagcgaca ctagaagaaa tgatgacagc 2400 
agcaagagtt ttggctgaag caatgagcca 2460 
gaaaggcaat tttaggaacc aaagaaagac 2520 
gcacatagcc aaaaattgca gggcccctag 2580 
aggacaccaa atgaaagatt gtactgagag 2640 
ttcccacaag ggaaggccag ggaattttct 2700 
agaagagagc ttcaggtttg gggaagagac 2760 
agacaaggaa ctgtatcctt tagcttccct 2820 
acaataaaga taggggggca attaaaggaa 2880 
gtattagaag aaatgaattt gccaggaaga 2940 
ggttttatca aagtaagaca gtatgatcag 3000 
ataggtacag tattagtagg acctacacct 3060 
cagattggct gcactttaaa ttttcccatt 3120 
aagccaggaa tggatggccc aaaagttaaa 3180 
gcattagtag aaatttgtac agaaatggaa 3240 
gaaaatccat acaatactcc agtatttgcc 3300 
aaattagtag atttcagaga acttaataag' 3360 
ggaataccac atcctgcagg gttaaaacag 3420 
gatgcatatt tttcagttcc cttagataaa 3480 
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gacttcagga agtatactgc atttaccata 
agatatcagt acaatgtgct tccacaggga 
agcatgacaa aaatcttaga gccttttaga 
tacatggatg atttgtatgt aggatctgac 
gaggaactga gacaacatct gttgaggtgg 
aaagaacctc cattcctttg gatgggttat 
cctatagtgc tgccagaaaa ggacagctgg 
aaattgaatt gggcaagtca gatttatgca 
cttaggggaa ccaaagcact aacagaagta 
ctggcagaaa acagggagat tctaaaagaa 
aaagacttaa tagcagaaat acagaagcag 
caagagccat ttaaaaatct gaaaacagga 
aatgatgtga aacaattaac agaggcagta 
tggggaaaga ctcctaaatt taaattaccc 
acagagtatt ggcaagccac ctggattcct 
gtgaagttat .ggtaccagtt agagaaagaa 
gatggggcag ccaataggga aact'aaatta 
agacaaaaag ttgtccccct aacggacaca 
catctagctt tgcaggattc gggattagaa 
ttgggaatca ttcaagcaca accagataag 
gagcagttaa taaaaaagga aaaagtctac 
ggaggaaatg aacaagtaga taaattggtc 
gatggaatag ataaggccca agaagaacat 
gctagtgatt ttaacctacc acctgtagta 
tgtcagctaa aaggggaagc catgcatgga 
ctagattgta cacatttaga aggaaaagtt 
tatatagaag cagaagtaat tccagcagag 
aaattagcag gaagatggcc agtaaaaaca 
agtactacag ttaaggccgc ctgttggtgg 
tacaatcccc aaagtcaagg agtaatagaa 
ggacaggtaa gagatcaggc tgaacatctt 
cacaatttta aaagaaaagg ggggattggg 
ataatagcaa cagacataca aactaaagaa 
tttcgggttt attacaggga cagcagagat 
tggaaaggtg aaggggcagt agtaatacaa 
agaaaagcaa agatcatcag ggattatgga 
agtagacagg atgaggatta acacatggaa 
tgggttcttg ggagcagcag gaagcactat 
ggccagacaa ttattgtctg gtatagtgca 
ggcgcaacag catctgttgc aactcacagt 
cctggctgtg gaaagatacc taaaggatca 
aaaactcatt tgcaccactg ctgtgccttg 
acagatttgg aatcacacga cctggatgga 
cttccgcgga attcacccca ccagtgcagg 
ctaatgccct ggcccacaag tatcactaag 
aggttccttt gttccctaag tccaactact 
catctggatt ctgcctaata aaaaacattt 
ttctgaatat tttactaaaa agggaatgtg 
tgaagagcta gttcaaacct tgggaaaata 
gaggctgcaa acagctaatg cacattggca 
cctcagaaaa ggattcaagt agaggcttga 
tttacattac ttattgtttt agctgtcctc 
tcctgcatct ctcagccttg actccactca 
ctgaagtgtt ccttccatgt tttacggcga 
ttagttgtct ctgttgtctt atagaggtct 
ttgactgtcc tgtgagccct tcttccctgc 
cgacatggca gtctagcact agtgcggccg 
gcgctcggtc gtt'cggctgc ggcgagcggt 
atccacagaa tcaggggata acgcaggaaa 
caggaaccgt aaaaaggccg cgttgctggc 
gcatcacaaa aatcgacgct caagtcagag 



cctagtataa acaatgagac accagggatt 3540 
tggaaaggat caccagcaat attccagtgt 3600 
aaacaaaatc cagacatagt catctatcaa 3660 
ttagaaatag ggcagcatag aacaaaaata 3720 
ggatttacca caccagacaa aaaacatcag 3780 
gaactccatc ctgataaatg gacagtacag 3840 
actgtcaatg acatacagaa attagtggga 3900 
gggattaaag taaggcaatt atgtaaactt 3960 
gtaccactaa cagaagaagc agagctagaa 4020 
ccggtacatg gagtgtatta tgacccatca 4080 
gggcaaggcc aatggacata tcaaatttat 4140 
aagtatgcaa gaatgaaggg tgcccacact 42 PO 
caaaaaatag ccacagaaag catagtaata 4260 
atacaaaagg aaacatggga agcatggtgg 4320 
gagtgggagt tbgtcaatac ccctccctta 4380 
cccataatag gagcagaaac tttctatgta 4440 
ggaaaagcag gatatgtaac tgacagagga 4500 
acaaatcaga agactgagtt acaagcaatt 4560 
gtaaacatag tgacagactc acaatatgca 4620 
agtgaatcag agttagtcag tcaaataata 4680 
ctggcatggg taccagcaca caaaggaatt 4740 
agtgctggaa tcaggaaagt actattttta 4800 
gagaaatatc acagtaattg gagagcaatg 4860 
gcaaaagaaa tagtagccag ctgtgataaa 4920' 
caagtagact gtagcccagg aatatggcag 4980 
atcttggtag cagttcatgt agccagtgga 5040 
acagggcaag aaacagcata cttcctctta 5100. 
gtacatacag acaatggcag caatttcacc 5160 
gcggggatca agcaggaatt tggcattccc 5220 
tctatgaata aagaattaaa gaaaattata 5260 
aagacagcag tacaaatggc agtattcatc 5340 
gggtacagtg caggggaaag aatagtagac 5400 
ttacaaaaac aaattacaaa aattcaaaat 5460 
ccagtttgga aaggaccagc aaagctcctc 5520- 
gataatagtg acataaaagt agtgccaaga 5580s 
aaacagatgg caggtgatga ttgtgtggca 5640 
ttccggagcg gccgcaggag ctttgttcct 5700' ■ 
gggcgcagcg tcaatgacgc tgacggtaca 5760 
gcagcagaac aatttgctga gggctattga 5820 
ctggggcatc aagcagctcc aggcaagaat 5880 
acagctcctg gggatttggg gttgctctgg 5940 
gaatgctagt tggagtaata aatctctgga 6000 
gtgggacaga gaaattaaca attacacaag 6060 
ctgcctatca gaaagtggtg gctggtgtgg 6120 
ctcgctttct tgctgtccaa tttctattaa 6180 
aaactggggg atattatgaa gggccttgag 6240 
attttcattg caatgatgta tttaaattat 6300 
ggaggtcagt gcatttaaaa cataaagaaa 6360 
cactatatct taaactccat gaaagaaggt 6420 
acagcccctg atgcctatgc cttattcatc 6480 
tttggaggtt aaagttttgc tatgctgtat 6540 
atgaatgtct tttcactacc catttgctta 6600 
gttctcttgc ttagagatac cacctttccc 6660 
gatggtttct cctcgcctgg ccactcagcc 6720 
acttgaagaa ggaaaaacag ggggcatggt 6780 
ctcccccact cacagtgacc cggaatccct 6840 
cagatctgct tcctcgctca ctgactcgct 6900 
atcagctcac tcaaaggcgg taatacggtt 6960 
gaacatgtga gcaaaaggcc agcaaaaggc 7020 
gtttttccat aggctccgcc cccctgacga 7080 
gtggcgaaac ccgacaggac tataaagata 7140 
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ccaggcgttt ccccctggaa gctccctcgt 
cggatacctg tccgcctttc tcccttcggg 
taggtatctc agttcggtgt aggtcgttcg 
cgttcagccc gaccgctgcg ccttatccgg 
acacgactta tcgccactgg cagcagccac 
aggcggtgct acagagttct tgaagtggtg 
atttggtatc tgcgctctgc tgaagccagt 
atccggcaaa caaaccaccg ctggtagcgg 
gcgcagaaaa aaaggatctc aagaagatcc 
gtggaacgaa aactcacgtt aagggatttt 
ctagatcctt ttaaattaaa aatgaagttt 
ttggtctgac agttaccaat gcttaatcag 
tcgttcatcc atagttgcct gactccccgt 
accatctggc cccagtgctg caatgatacc 
atcagcaata aaccagccag ccggaagggc 
cgcctccatc cagtctatta attgttgccg 
tagtttgcgc aacgttgttg ccattgctac 
tatggcttca ttcagctccg gttcccaacg 
gtgcaaaaaa gcggttagct ccttcggtcc 
agtgttatca ctcatggtta tggcagcact 
aagatgcttt tctgtgactg gtgagtactc 
gcgaccgagt tgctcttgcc cggcgtcaat 
tttaaaagtg ctcatcattg gaaaacgttc 
gctgttgaga tccagttcga tgtaacccac 
tactttcacc agcgtttctg ggtgagcaaa 
aataagggcg acacggaaat gtt'gaatact 
catttatcag ggttattgtc tcatgagcgg 
acaaataggg gttccgcgca catttccccg 
gggcccccat gggctagagg atccggcctc 
gccatgagc 



gcgctctcct gttccgaccc tgccgcttac 7200 
aagcgtggcg ctttctcata gctcacgctg 7260 
ctccaagctg ggctgtgtgc acgaaccccc 7320 
taactatcgt cttgagtcca acccggtaag 7380 
tggtaacagg attagcagag cgaggtatgt 7440 
gcctaactac ggctacacta gaagaacagt 7500 
taccttcgga aaaagagttg gtagctcttg 7560 
tggttttttt gtttgcaagc agcagattac 7620 
tttgatcttt tctacggggt ctgacgctca 7680 
ggtcatgaga ttatcaaaaa ggatcttcac 7740 
taaatcaatc taaagtatat atgagtaaac 7800 
tgaggcacct atct'cagcga tctgtctatt 7860 
cgtgtagata actacgatac gggagggctt 7920 
gcgagaccca cgctcaccgg ctccagattt 7980 
cgagcgcaga agtggtcctg caactttatc 8040 
ggaagctaga gtaagtagtt cgccagttaa 8100 
aggcatcgtg gtgtcacgct cgtcgtttgg 8160 
atcaaggcga gttacatgat cccccatgtt 8220 
tccgatcgtt gtcagaagta agttggccgc 82 80 
gcataattct cttactgtca tgccatccgt 8340 
aaccaagtca ttctgagaat agtgtatgcg 8400 
acgggataat accgcgccac atagcagaac 8460 
ttcggggcga aaactctcaa ggatcttacc 8520 
tcgtgcaccc aactgatctt cagcatcttt 8580 
aacaggaagg caaaatgccg caaaaaaggg 8640 
catactcttc ctttttcaat attattgaag 8700 
atacatattt gaatgtattt agaaaaataa. 8760 
aaaagtgcca cctgacggga tcccctgagg 8820 
ggcctctgca taaataaaaa aaattagtca 8880 

8889 



<210> 50 ■ 
<211> 4180 
<212> DMA- 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: /note- s 
synthetic construct 

<400> 50 ■ 

aatgtagtct tatgcaatac tcttgtagtc ttgcaacatg gtaacgatga gttagcaaca 60 
tgccttacaa ggagagaaaa agcaccgtgc atgccgattg gtggaagtaa ggtggtacga 120 
tcgtgcctta ttaggaaggc aacagacggg tctgacatgg attggacgaa ccactgaatt 180 
ccgcattgca gagatattgt atttaagtgc ctagctcgat acaataaacg ccatttgacc 240 
attcaccaca ttggtgtgca cctccaagct cgagctcgtt tagtgaaccg tcagatcgcc 300 
tggagacgcc atccacgctg ttttgacctc catagaagac accgggaccg atccagcctc 360 
ccctcgaagc tagtcgatta ggcatctcct atggcaggaa gaagcggaga cagcgacgaa 420 
gacctcctca aggcagtcag actcatcaag tttctctatc aaagcaaccc acctcccaat 480 
cccgagggga cccgacaggc ccgaaggaat agaagaagaa ggtggagaga gagacagaga 540 
cagatccatt cgattagtga acggatcctt agcacttatc tgggacgatc tgcggagcct 600 
gtgcctcttc agctaccacc gcttgagaga cttactcttg attgtaacga ggattgtgga 660 
acttctggga cgcagggggt gggaagccct caaatattgg tggaatctcc tacaatattg 720 
gagtcaggag ctaaagaata gtgctgttag cttgctcaat gccacagcta tagcagtagc 780 
tgaggggaca gatagggtta tagaagtagt acaagaagct tggcactggc cgtcgtttta 840 
caacgtcgtg atctgagcct gggagatctc tggctaacta gggaacccac tgcttaagcc 900 
tcaataaagc ttgccttgag tgcttcaagt agtgtgtgcc cgtctgttgt gtgactctgg 960 
taactagaga tcaggaaaac cctggcgtta cccaacttaa tcgccttgca gcacatcccc 1020 
ctttcgccag ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc caacagttgc 1080 
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gcagcctgaa tggcgaatgg cgcctgatgc ggtattttct ccttacgcat ctgtgcggta 1140 
tttcacaccg catacgtcaa agcaaccata gtacgcgccc tgtagcggcg cattaagcgc 1200 
ggcgggtgtg gtggttacgc cfcagcgtgac cgctacactt gccagcgccc tagcgcccgc 1260 
tcctttcgct ttcttccctt cctttctcgc cacgttcgcc ggctttcccc gtcaagctct 1320 
aaatcggggg ctccctttag ggttccgatt tagtgcttta cggcacctcg accccaaaaa 1380 
acttgatttg ggtgatggtt cacgtagtgg gccatcgccc tgatagacgg tttttcgccc 1440 
tttgacgttg gagtccacgt tctttaatag tggactcttg ttccaaactg gaacaacact 15 OO 
caaccctatc tcgggctatt cttttgattt ataagggatt ttgccgattt cggcctattg 1560 
gttaaaaaat gagctgattt aacaaaaatt taacgcgaat tttaacaaaa tattaacgtt 1620 
tacaatttta tggtgcactc tcagtacaat ctgctctgat gccgcatagt taagccagcc 1680 
ccgacacccg ccaacacccg ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc 1740 
ttacagacaa gctgtgaccg tctccgggag ctgcatgtgt cagaggtttt caccgtcatc 1800 
accgaaacgc gcgagacgaa agggcctcgt gatacgccta tttttatagg ttaatgtcat 1860 
gataataatg gtttcttaga cgtcaggtgg cacttttcgg ggaaatgtgc gcggaacccc 1920 
tatttgttta tttttctaaa tacattcaaa tatgtatccg ctcatgagac aataaccctg 1980 
ataaatgctt caataatatt gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc 2040 
ccttattccc ttttttgcgg- cattttgcct tcctgttttt gctcacccag aaacgctggt 2100 
gaaagtaaaa gatgctgaag atcagttggg tgcacgagtg ggttacatcg aactggatct 2160 
caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa tgatgagcac 2220 
ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc aagagcaact 2280 
cggtcgccgc atacactatt ctcagaatga cttggttgag tactcaccag tcacagaaaa 2340 
gcatcttacg gatggcatga cagtaagaga attatgcagt gctgccataa ccatgagtga 2400 
taacactgcg gccaacttac ttctgacaac gatcggagga ccgaaggagc taaccgcttt 2460 
tttgcacaac atgggggatc atgtaactcg ccttgatcgt tgggaaccgg agctgaatga 2520 
agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa caacgttgcg 2580 
caaactatta actggcgaac tacttactct agcttcccgg caacaattaa tagactggat 2640 
ggaggcggat aaagttgcag gaecapttct gcgctcggcc cttccggctg gctggtttat 2700 
tgctgataaa tctggagccg gtgagcgtgg gtctcgcggt atcattgcag cactggggcc 2760 
agatggtaag ccctcccgta tcgtagttat ctacacgacg gggagtcagg caactatgga 2820 
tgaacgaaat agacagatcg ctgagatagg tgcctcactg attaagcatt ggtaactgtc 2880 
agaccaagtt tactcatata tactttagat tgatttaaaa cttcattttt aatttaaaag 2940 
gatctaggtg aagatccttt ttgataatct catgaccaaa atcccttaac gtgagttttc 3000 
gttccactga gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt 3060 
tctgcgcgta atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt 3120 
gccggatcaa gagctaccaa ctctttttcc gaaggtaact ggcttcagca gagcgcagat 3180 
accaaatact gttcttctag tgtagccgta gttaggccac cacttcaaga actctgtagc 3240 
accgcctaca tacctcgctc tgctaatcct gttaccagtg gctgctgcca gtggcgataa 3300; 
gtcgtgtctt accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg 3360 
ctgaacgggg ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag 3420 
atacctacag cgtgagctat gagaaagcgc cacgcttccc gaagggagaa aggcggacag 3480 
gtatccggta agcggcaggg tcggaacagg agagcgcacg agggagcttc cagggggaaa 3540 
cgcctggtat ctttatagtc ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt 3600 
gtgatgctcg tcaggggggc ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg 3660 
gttcctggcc ttttgctggc cttttgctca catgttcttt cctgcgttat cccctgattc 3720 
tgtggataac cgtattaccg cctttgagtg agctgatacc gctcgccgca gccgaacgac 3780 
cgagcgcagc gagtcagtga gcgaggaagc ggaagagcgc ccaatacgca aaccgcctct 3840 
ccccgcgcgt tggccgattc attaatgcag ctggcacgac aggtttcccg actggaaagc 3900 
gggcagtgag cgcaacgcaa ttaatgtgag ttagctcact cattaggcac cccaggcttt 3960 
acactttatg cttccggctc gtatgttgtg tggaattgtg agcggataac aatttcacac 4020 
aggaaacagc tatgacatga ttacgaattc gatgtacggg ccagatatac gcgtatctga 4080 
ggggactagg gtgtgtttag gcgaaaagcg gggcttcggt tgtacgcggt taggagtccc 4140 
ctcaggatat agtagtttcg cttttgcata gggaggggga 4180 

<210> 51 
<211> 5821 
<212> DNA 

<213> T^tiflcial Sequence 
<220> 

<223> Description of Artificial Sequence: /note =■ 
synthetic construct. 
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<400> 51 

ttggcccatt gcatacgttg tatccatatc 
caacattacc gccatgttga cattgattat 
ggtcattagt tcatagccca tatatggagt 
cgcctggctg accgcccaac gacccccgcc 
tagtaacgcc aatagggact ttccattgac 
cccacttggc agtacatcaa gtgtatcata 
acggtaaatg gcccgcctgg cattatgccc 
ggcagtacat ctacgtatta gtcatcgcta 
tcaatgggcg tggatagcgg tttgactcac 
tcaatgggag tttgttttgg caccaaaatc 
ccgccccatt gacgcaaatg ggcggtaggc 
ctcgtttagt gaaccgtcag atcgcctgga 
gaagacaccg ggaccgatcc agcctcccct 
cctgagaact tcagggtgag tctatgggac 
tggttaagtt catgtcatag gaaggggaga 
gggtaatttt gcatttgtaa ttttaaaaaa 
tatcttattt ctaatacttt ccctaatctc 
tcatgcctct ttgcaccatt ctaaagaata 
caatatttct gcatataaat atttctgcat 
attgctaata gcagctacaa tccagctacc 
gctggattat tctgagtcca agctaggccc 
tcctcccaca gctcctgggc aacgtgctgg 
cacgtgagat ctgaattctg acactatgaa 
tggggtgaat tgcaagttca ccatagtttt 
tgttccttct aattaccatt attgcccgtc 
aataggcaca gccttacaag tcaaaatgcc 
ttggatgtgt catgcttcca aatgggtcac 
gtatataaca cattccatcc gatccttcac 
tgaacaaacg aaacaaggaa cttggctgaa 
tgcaactgtg acggatgccg aagcagtgat 
tgatgaatac acaggagaat gggttgattc 
catatgcccc actgtccata actctacaac 
atgtgattct aacctcattt ccatggacat 
atccctggga aaggagggca cagggttcag 
caaggcctgc aaaatgcaat actgcaagca 
gttcgagatg gctgataagg atctctttgc 
gtcaagtatc tctgctccat ctcagacctc 
gaggatcttg gattattccc tctgccaaga 
aatctctcca gtggatctca gctatcttgc 
caccataatc aatggtaccc taaaatactt 
tgctccaatc ctctcaagaa tggtcggaat 
gtgggatgac tgggcaccat atgaagacgt 
cagttcagga tataagtttc ctttatacat 
tcatcttagc tcaaaggctc aggtgttcga 
acttcctgat gatgagagtt tattttttgg 
gcttgtagaa ggttggttca gtagttggaa 
agggttaatc attggactat tcttggttct 
aaagcacacc aagaaaagac agatttatac 
actcaaatcc tgcacaacag attcttcatg 
tcaaagaggc ctcaattata tttgagtttt 
gaattcaccc caccagtgca ggctgcctat 
ctggcccaca agtatcacta agctcgcttt 
ttgttcccta agtccaacta ctaaactggg 
ttctgcctaa taaaaaacat ttattttcat 
attttactaa aaagggaatg tgggaggtca 
tagttcaaac cttgggaaaa tacactatat 
aaacagctaa tgcacattgg caacagcccc 
aaggattcaa gtagaggctt gatttggagg 
acttattgtt ttagctgtcc tcatgaatgt 



ataatatgta catttatatt ggctcatgtc 60 
tgactagtta ttaatagtaa tcaattacgg 120 
tccgcgttac ataacttacg gtaaatggcc 180 
cattgacgtc aataatgacg tatgttccca 240 
gtcaatgggt ggagtattta cggtaaactg 300 
tgccaagtac gccccctatt gacgtcaatg 360 
agtacatgac cttatgggac tttcctactt 420 
ttaccatggt gatgcggttt tggcagtaca 480 
ggggatttcc aagtctccac cccattgacg 540 
aacgggactt tccaaaatgt cgtaacaact 600 
gtgtacggtg ggaggtctat ataagcagag 660 
gacgccatcc acgctgtttt gacctccata 720 
cgaagcttac atgtggtacc gagctcggat 780 
ccttgatgtt ttctttcccc ttcttttcta 840 
agtaacaggg tacacatatt gaccaaatca 900 
tgctttcttc ttttaatata cttttttgtt 960 
tttctttcag ggcaataatg atacaatgta 1020 
acagtgataa tttctgggtt aaggcaatag 1080 
ataaattgta actgatgtaa gaggtttcat 1140 
attctgcttt tattttatgg ttgggataag 1200 
ttttgctaat catgttcata cctcttatct 1260 
tctgtgtgct ggcccatcac tttggcaaag 1320 
gtgccttttg tacttagcct ttttattcat 1380 
tccacacaac caaaaaggaa actggaaaaa 1440 
aagctcagat ttaaattggc ataatgactt 1500 
caagagtcac aaggctattc aagcagacgg 1560 
tacttgtgat ttccgctggt atggaccgaa 1620 
tccatctgta gaacaatgca aggaaagcat 1680 
tccaggcttc cctcctcaaa gttgtggata 1740 
tgtccaggtg actcctcacc atgtgctggt 1800 
acagttcatc aacggaaaat gcagcaatta 1860 
ctggcattct gactataagg tcaaagggct 1920 
caccttcttc tcagaggacg gagagctatc 1980 
aagtaactac tttgcttatg aaactggagg 2040 
ttggggagtc agactcccat caggtgtctg 2100. 
tgcagccaga ttccctgaat gcccagaagg 2160 
agtggatgta agtctaattc aggacgttga 2220 
aacctggagc aaaatcagag cgggtcttcc 2280 
tcctaaaaac ccaggaaccg gtcctgcttt 2340 
tgagaccaga tacatcagag tcgatattgc 2400 
gatcagtgga actaccacag aaagggaact 2460 
ggaaattgga cccaatggag ttctgaggac 2520 
gattggacat ggtatgttgg actccgatct 2580 
acatcctcac attcaagacg ctgcttcgca 2640 
tgatactggg ctatccaaaa atccaatcga 2700 
aagctctatt gcctcttttt tctttatcat 2760 
ccgagttggt atccatcttt gcattaaatt 2820 
agacatagag atgaaccgac ttggaaagta 2880 
tttggaccaa atcaacttgt gataccatgc 2940 
taatttttat gaaaaaaaaa aaaaaaaacg 3000 
cagaaagtgg tggctggtgt ggctaatgcc 3060 
cttgctgtcc aatttctatt aaaggttcct 3120 
ggatattatg aagggccttg agcatctgga 3180 
tgcaatgatg tatttaaatt atttctgaat 3240 
gtgcatttaa aacataaaga aatgaagagc 3300 
cttaaactcc atgaaagaag gtgaggctgc 3360 
tgatgcctat gccttattca tccctcagaa 3420 
ttaaagtttt gctatgctgt attttacatt 3480 
cttttcacta cccatttgct tatcctgcat 3540 
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ctctcagcct tgactccact cagttctctt gcttagagat accacctttc ccctgaagtg 3600 
ttccttccat gttttacggc gagatggttt ctcctcgcct ggccactcag ccttagttgt 3660 
ctctgttgtc ttatagaggt ctacttgaag aaggaaaaac agggggcatg gtttgactgt 3720 
cctgtgagcc cttcttccct gcctccccca ctcacagtga c.ccggaatcc ctcgacatgg 3780 
cagtctagca ctagtgcggc cgcagatctg cttcctcgct cactgactcg ctgcgctcgg 3840 
tcgttcggct gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag 3900 
aatcagggga taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc 3960 
gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca 4020 
aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt 4080 
ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc 4140 
tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc 4200 
tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc 4260 
ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact 4320 
tatcgccact ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg 4380 
ctacagagtt cttgaagtgg tggcctaact acggctacac tagaagaaca gtatttggta 4440 
tctgcgctct gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca 4500 
aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa 4560 
aaaaaggatc tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg 4620 
aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc 4680 
ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg 4740 
acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat 4800 
ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc ttaccatctg 4860 
gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa 4920 
taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta tccgcctcca 4980 
tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt aatagtttgc 5040 
gcaacgttgt tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt 5100 
cattcagctc cggttcccaa cgatcaaggc gagttacatg atc.ccccatg ttgtgcaaaa 5160 
aagcggttag ctccttcggt cctccgatcg t.tgtcagaag taagttggcc gcagtgttat 5220 
cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct 5280 
tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga 5340 
gttgctcttg cccggcgtca atacgggata ataccgcgcc acatagcaga actttaaaag 5400 
tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga 5460 
gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca 5520 
ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg 5580 
cgacacggaa atgttgaata ctcatactct tcctttttca atattattga agcatttatc 5640 
a99gttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag 5700 
gggttccgcg cacatttccc cgaaaagtgc cacctgacgg gatcccctga gggggccccc 5760 
atgggctaga ggatccggcc tcggcctctg cataaataaa aaaaattagt cagccatgag 5820 
C. 5821 



<210> 52 
<211> 1986 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : /note = 
synthetic construct 

<400> 52 ' 

tcctcctctg ctcgcccaat ccttccaacc ccctatggtg gtatggctga cacagaaaat 60 
gtctgctcct gtatgggaca tttgcccctc ttctccaaat ataagacagg atgaggccta 120 

gcttttgctg ctccaaagtt ttaaaagaac acattgcacg gcatttaggg actctaaagg 180 

gtggaggagg aatgagggaa ttgcatcatg ccaaggctgg tcctcatcca tcactgcttc 240 

cagggcccag agtggcttcc aggaggtatt cttacaaagg aagcccgatc tgtagctaac 300 

actcagagcc cattttcctg cgttaacccc tcccgacctc atatacagga gtaacatgat 360 

cagtgacctg ggggagctgg ccaaactgcg ggacctgccc aagctgaggg ccttggtgct 420 

gctggacaac ccctgtgccg atgagactga ctaccgccag gaggccctgg tgcagatggc 480 

acacctagag cgcctagaca aagagtacta tgaggacgag gaccgggcag aagctgagga 540 
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gatccgacag aggctgaagg aggaacagga gcaagaactc gacccggacc aagacatgga 600 

accgtacctc ccgccaactt agtggctcct ctagcctgca gggacagtaa aggtgatggc 660 

aggaaggcag cccccggagg tcaaaggctg ggcacgcggg aggagaggcc agagtcagag 720 

gctgcgggta tctcagatat gaaggaaaga tgagagaggc tcaggaagag gtaagaaaag 780 

acacaagaga ccagagaagg gagaagaatt agagagggag gcagaggacc gctgtctcta 8^0 

cagacatagc tggtagagac tgggaggaag ggatgaaccc tgagcgcatg aagggaagga 900 

ggtggctggt ggtatatgga ggatgtagct gggccaggga aaagatcctg cactaaaaat 960 

ctgaagctaa aaataacagg acacggggtg gagaggcgaa aggagggcag attgaggcag 1020 

agagactgag aggcctgggg atgtgggcat tccggtaggg cacacagttc acttgtcttc 1080 

tctttttcca ggaggccara gatgctgacc tcaagaactc ataatacccc agtggggacc 1140 

accgcattca tagccctgtt aca^gaagtg ggagatgttc ctttttgtcc cagactggaa 1200 

atccattaca tcccgaggct caggttctgt ggtggtcatc tctgtgtggc ttgttctgtg 1260 

ggcctaccta aagtcctaag cacagctctc aagcagatcc gaggcgacta agatgctagt 1320 

aggggttgtc tggagagaag agccgaggag gtgggctgtg atggatcagt tcagctttca 1380 

aataaaaagg cgtttttata ttctgtgtcg agttcgtgaa cccctgtggt gggcttctcc 1440 

atctgtctgg gttagtacct gccactatac tggaataagg ggacgcctgc ttccctcgag 1500 

ttggctggac aaggttatga gcatccgtgf acttatgggg ttgccagctt ggtcctggat 1560 

cgcccgggcc cttcccc'cac ccgttcggtt ccccaccacc acccgcgctc gtacgtgcgt 1620 

ctccgcctgc agctcttgac tcatcggggc ccccgggtca catgcgctcg ctcggctcta 1680 

taggcgccgc cccctgccca ccccccgccc gcgctgggag ccgcagccgc cgccactcct 1740 

gctctctctg cgccgccgcc gtcaccaccg ccaccgccac cggctgagtc tgcagtcctc 1800 

gaggaactga aaaaccagaa agttaactgg taagtttagt ctttttgtct tttatttcag 1860 

gtcccggatc cggtggtggt gcaaatcaaa gaactgctcc tcagtggatg ttgcctttac 1920 

ttctaggcct gtacggaagt gtkacttctg ctctaaaagc tgcggaattg tacccgcggc 1980 

caagct 1986^ 

<210> 53- ..... 

<211> 1610 . ..." 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: /note » 
synthetic construct 

<400> 53 • 

cggatgagcc tctgtgaact actaaggtgg gagggggcta tacgcagagg agaatgtcag 60 
atgctcagct cggtcccctc cgcctgacgc tcctctctgt ctcagccagg actggtttct 120 
gtaagaaaca gcaggagctg tggcagcggc gaaaggaagc ggctgaggcg cttggaaccc 180 
gaaaagtctc ggtgctcctg gctacctcgc acagcgtgcc cgcccggccg tcagtaccat 240 
ggacagcagc gctgccccca cgaacgccag caattgcact gatgccttgg cgtactcaag 300 
ttgctcccca gcacccagcc ccggttcctg ggtcaacttg tcccacttag atggcaacct 360 
gtccgaccca tgcggtccga accgcaccga cctgggcggg agagacagcc tgtgccctcc 420 
gaccggcagt ccctccatga tcacggccat cacgatcatg gccctctact ccatcgtgtg 480 
cgtggtgggg ctcttcggaa acttcctggt catgtatgtg attgtcagat acaccaagat 540 
gaagactgcc accaacatct acattttcaa ccttgctctg gcagatgcct tagccaccag 600 
taccctgccc ttccagagtg tgaattacct aatgggaaca tggccatttg gaaccatcct 660 
ttgcaagata gtgatctcca tagattacta taacatgttc accagcatat tcaccctctg 720 
caccatgagt gttgatcgat acattgcagt ctgccaccct gtcaaggcct tagatttccg 780 
tactccccga aatgccaaaa ttatcaatgt ctgcaactgg atcctctctt cagccattgg 840 
tcttcctgta atgttcatgg ctacaacaaa atacaggcaa ggttccatag attgtacact 900 
aacattctct catccaacct ggtactggga aaacctgctg aagatctgtg ttttcatctt 960 
cgccttcatt atgccagtgc tcatcattac cgtgtgctat ggactgatga tcttgcgcct 1020 
caagagtgtc cgcatgctct ctggctccaa agaaaaggac aggaatcttc gaaggatcac 1080 
caggatggtg ctggtggtgg tggctgtgtt catcgtctgc tggactccca ttcacattta 1140 
cgtcatcatt aaagccttgg ttacaatccc agaaactacg ttccagactg tttcttggca 1200 
cttctgcatt gctctaggtt acacaaacag ctgcctcaac ccagtccttt atgcatttct 1260 
ggatgaaaac ttcaaacgat gcttcagaga gttctgtatc ccaacctctt ccaacattga 1320 
gcaacaaaac tccactcgaa ttcgtcagaa cactagagac cacccctcca cggccaatac 1380 
agtggataga actaatcatc agctagaaaa tctggaagca gaaactgctc cgttgcccta 1440 
acagggtctc atgccattcc gaccttcacc aagcttagaa gccaccatgt atgtggaagc 1500 
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aggttgcttifc aagaatgtgt aggaggctct aattctctag gaaagtgcct gcttttaggt 1560 
catccaacct ctttcctctc tggccactct gctctgcaca ttagaggccg 1610 

<210> 54 . . 

<211> 1536 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sec[uence : /note = 
synthetic construct 

<400> 54 ... 

atgaagtgcc ttttgtactt agccttttta ttcattgggg tgaattgcaa gttcaccata 60 
gtttttccac acaaccaaaa aggaaactgg aaaaatgttc cttctaatta ccattattgc 120 
ccgtcaagct cagatttaaa ttggcataat gacttaatag gcacagcctt acaagtcaaa 180 
atgcccaaga gtcacaaggc tattcaagca gacggttgga tgtgtcatgc ttccaaatgg 240 
gtcactactt gtgatttccg ctggtatgga ccgaagtata taacacattc catccgatcc 300 . 
ttcactccat ctgtagaaca atgcaaggaa agcattgaac aaacgaaaca aggaacttgg 350 
ctgaatccag gcttccctcc tcaaagttgt ggatatgcaa ctgtgacgga tgccgaagca 420 
gtgattgtcc aggtg^ctcc tcaccatgtg ctggttgatg aatacacagg agaatgggtt 480 
gattcacagt tcatcaacgg aaaatgcagc aattacatat gccccactgt ccataactct 540 
acaacctggc attctgacta taaggtcaaa gggctatgtg attctaacct catttccatg 600 
gacatcacct tcttctcaga ggacggagag ctatcatccc tgggaaagga gggcacaggg 660 
ttcagaagta actactttgc ttatgaaact ggaggcaagg cctgcaaaat gcaatactgc 720 
aagcattggg gagtcagact cccatcaggt gtctggttcg agatggctga taaggatctc 780 
tttgctgcag ccagattccc tgaatgccca gaagggtcaa gtatctctgc tccatctcag 840 
acctcagtgg atgtaagtct aattcaggac gttgagagga tcttggatta ttccctctgc 900 
caagaaacct ggagcaaaat cagagcgggt cttccaatct ctccagtgga tctcagctat 960 
cttgctccta aaaacccagg aaccggtcct gctttcacca taatcaatgg taccctaaaa 1020 
tactttgaga ccagatacat cagagtcgat attgctgctc caatcctctc aagaatggtc 1080 
ggaatgatca gtggaactac cacagaaagg gaactgtggg atgactgggc accatatgaa 1140 
gacgtggaaa ttggacccaa tggagttctg aggaccagtt caggatataa gtttccttta 1200 
tacatgattg gacatggtat gttggactcc gatcttcatc ttagctcaaa ggctcaggtg 1260 
ttcgaacatc ctcacattca agacgctgct tcgcaacttc ctgatgatga gagtttattt 1320. 
tttggtgata ctgggctatc caaaaatcca atcgagcttg tagaaggttg gttcagtagt 1380 . 
tggaaaagct ctattgcctc ttttttcttt atcatagggt taatcattgg actattcttg 1440 
gttctccgag ttggtatcca tctttgcatt aaattaaagc acaccaagaa aagacagatt 1500 
tatacagaca tagagatgaa ccgacttgga aagtaa 1536 
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